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OM protein - protein search, using sw model 



Run on: 



February 28, 2004, 08:56:55 ; Search time 59.4242 Seconds 

(without alignments) 
2462.961 Million cell updates/sec 



Title: 

Perfect score 
Sequence : 



US-09-668-314C-2 
2687 

1 MGALARALLLPLLAQWLLRA, 



RPRDPEWNDESSLVRHRWK 518 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1586107 seqs, 282547505 residues 

Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1586107 



Database : 



A_Genes 

1: gene 

2 : gene 

3: gene 

4 : gene 

5 : gene 

6: gene 

7 : gene 

8 : gene 



eq_2 9Jan04:* 
seqpl980s : * 
seqpl990s : * 
seqp2000s : * 
seqp2001s : * 
seqp2002s : * 
seqp2003as : * 
seqp2003bs : * 
seqp2 004 s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AAW61362 

ID AAW61362 standard; protein; 518 AA. 
XX 

AC AAW61362; 
XX 

DT 25-MAR-2003 (revised) 

DT 25-SEP-1998 (first entry) 

XX 

DE Aspartic proteinase ASP1. 
XX 

KW ASP1; Aspartic proteinase; Alzheimer's disease; cancer; melanoma. 
XX 

OS Homo sapiens. 
XX 

PN EP848062-A2. 



XX 

PD 17-JUN-1998 . 
XX 

PF 01-DEC-1997; 97EP-0030964 8 . 
XX 

PR 14-DEC-1996; 96GB-00026022 . 
PR 06-OCT-1997; 97US-00999723 . 
XX 

PA (SMIK ) SMITHKLINE BEECHAM CORP. 
PA (SMIK ) SMITHKLINE BEECHAM PLC. 
XX 

PI Powell DJ, Southan C, Chapman CG, Evans JR; 
XX 

DR WPI; 1998-314477/28. 
DR N-PSDB; AAV27962. 
XX 

PT New isolated polynucleotide encodes Aspartic protease polypeptide - used 
PT to diagnosis, treat and vaccinate against Alzheimer f s disease, cancer and 
PT melanoma. 
XX 

PS Claim 11; Page 7; 19pp; English. 
XX 

CC The human ASP1 protein is structurally related to other proteins of the 
CC Aspartic proteinase family. ASP 1 polypeptides and polynucleotides can be 
CC used to diagnosis, treat and vaccinate against Alzheimer's disease, 
CC cancer and melanoma. (Updated on 25-MAR-2003 to correct PR field.) 
XX 

SQ Sequence 518 AA; 

Query Match 100.0%; Score 2687; DB 2; Length 518; 

Best Local Similarity 100.0%; Pred. No. 8.6e-231; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MG/VLARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MGALARAL L L P L LAQW LLRAAP E LAPAP FT L P L RVAAATN RWAP T P G P GT PAERHADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

I I I I I I I I I M I I I I I I I I I I I I I I I I I II II II II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 18 0 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDWVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 



Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

M I I I I I I I I I I I II I I I I I I I I I I I I I I I I | I I | I I | | M i | | | | | | | | | | | | I I I I I I 
Db 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

Qy 421 RAQKRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N CVP AQ S L S E P I LW I VS YALMS VC G 480 

I I I I I I M I I I I I I I I I I M I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 RAQKRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N C VP AQ S L S E P I LW I VS YALMS VC G 48 0 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I II I I 

Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 2 
AAY13799 

ID AAY13799 standard; protein; 518 AA. 
XX 

AC AAY13799; 
XX 

DT 21-SEP-1999 (first entry) 
XX 

DE Human aspartyl protease, CSP56. 
XX 

KW CSP56; human; aspartyl protease; diagnosis; neoplasia; tumour; 

KW breast tumour; colon tumour. 

XX 

OS Homo sapiens . 
XX 

PN W09933963-A1. 
XX 

PD 08-JUL-1999. 
XX 

PF 14-DEC-1998; 98WO-US026547 . 
XX 

PR 31-DEC-1997; 97US-0070112P . 
XX 

PA (CHIR ) CHIRON CORP. 
XX 

PI Giese KW, Xin H; 
XX 

DR WPI; 1999-430240/36. 
DR N-PSDB; AAX89297. 
XX 

PT Human CSP56 protein for diagnosis of neoplasia. 
XX 

PS Claim 2; Fig 2A; 51pp; English. 
XX 

CC This represents a human CSP56 protein, a novel aspartyl protease. The 

CC CSP56 protein can be used in methods for diagnosing neoplasia, for 

CC determining the metastatic potential of a tumour, and for screening test 

CC compounds for the ability to suppress the metastatic potential of a 

CC tumour. The tumours are preferably from breast or colon 

XX 

SQ Sequence 518 AA; 



Query Match 

Best Local Similarity 



100.0%; 
100.0%; 



Score 2687; DB 2; Length 518; 
Pred. No. 8.6e-231; 



Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARAL LL P L LAQWL L RAAP E LAP AP FT L P L RVAAATN R WAP T P G P GT P AE RHAD GL 60 

M I I I I I II I I I II I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I || || | | | | I I I I | 
Db 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

Qy 61 ALALE P ALAS PAGAAN FLAMVDNLQ GD S GRG Y YLEMLI GT P PQKLQ I LVDTG S SN FAVAG 12 0 

I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | 
Db 61 ALALEPALAS PAGAAN FLAMVDNLQGDSGRGYYLEML I GTP PQKLQ I LVDTGSSN FAVAG 12 0 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I M I I I I M I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I M I I I M I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVTDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I | 
Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS S RS FRI T I LPQL YI QPMMGAGLN YEC YRFGI S P STNALVI GATVMEGFYVI FD 420 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 361 YLRDENS SRS FRI TILPQLYIQPMMGAGLNYECYRFGISPSTNALVI GATVMEGFYVI FD 420 

Qy 421 RAQKRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N CVP AQ S L S EP I LW I VS YALMS VC G 480 

I I M I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 RAQKRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N CVP AQ S L S E P I LW I VS YALMS VC G 480 

Qy 481 AI LLVLI VLLLLPFRCQRRPRDPEWNDES SLVRHRWK 518 

I I I I I I I I I I I I II I I I I I I I I I I I I II I II I I I I I I I 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDES SLVRHRWK 518 



RESULT 3 


AAY22239 


ID 


AAY22239 standard; protein; 518 AA. 


XX 




AC 


AAY22239; 


XX 




DT 


20-SEP-1999 (first entry) 


XX 




DE 


Human CSP56, aspartyl-type protease, protein sequence. 


XX 




KW 


Metastatic marker protein; human; cancer metastasis; breast cancer; 


KW 


colon cancer; diagnosis; therapy; tumour; metastatic potential; CSP56; 


KW 


aspartyl-type protease. 


XX 




OS 


Homo sapiens. 


XX 




PN 


WO9934004-A2. 


XX 





PD 08-JUL-1999 . 
XX 

PF 24-DEC-1998; 98WO-US027608 . 
XX 

PR 31-DEC-1997; 97US-007 0112P . 
XX 

PA (CHIR ) CHIRON CORP. 
XX 

PI Xin H, Giese K; 
XX 

DR WPI; 1999-430248/36. 

DR N-PSDB; AAX84708. 
XX 

PT New polynucleotides associated with cancer metastasis. 
XX 

PS Claim 4; Page 78-80; 80pp; English. 
XX 

CC This sequence represents a polypeptide of the invention, and is an 

CC aspartyl-type protease, designated CSP56. The polynucleotides (PNs) of 

CC the invention encode metastatic marker protein variants. The PNs and 

CC polypeptides can be used as markers for cancer metastasis. The products 

CC can be used for identifying metastatic tissue or metastatic potential of 

CC a tissue, e.g. breast or colon tissue. They can also be used for 

CC screening test compounds for the ability to suppress the metastatic 

CC potential of a tumour. The products can be used for developing products 

CC for the therapy of cancers, particularly breast or colon cancer 

XX 

SQ Sequence 518 AA; 



Query Match 100.0%; Score 2687; DB 2; Length 518; 

Best Local Similarity 100.0%; Pred. No. 8.6e-231; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MG ALARAL L L P L LAQW L L RAAP E LAP AP FT L P L RVAAATN RWAP T P G P GT P AE RHAD GL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 1 MG ALARAL L L P L LAQW L L RAAP E LAP AP FT L P L RVAAATN RWAP T P G P GT P AE RHAD G L 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I 
Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 



Qy 181 FESENFFLPGI KWN G I L G LAYAT LAK P S S S LET FFD S LVTQAN I PNVFS MQMCGAGL P VA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 



Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 



Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 



Qy 



361 YLRDENS SRS FRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FD 420 




Db 



361 YLRDENS SRS FRIT I LPQLYIQPMMGAGLNYECYRFGISPSTNALVI GATVMEGFYVI FD 420 



QY 



421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVASNCVP AQ S L S E P I LW I VS YALMS VC G 480 




Db 



421 RAQ KRVG FAAS P CAE I AGAAVS E I S G P F S T EDVAS NC VP AQ S L S E P I LW I VS YALMS VC G 480 



Qy 



481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 




Db 



481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 4 
AAY41714 

ID AAY41714 standard; protein; 518 AA. 
XX 

AC AAY41714; 
XX 

DT 07-DEC-1999 (first entry) 
XX 

DE Human PR0852 protein sequence. 
XX 

KW Human; PRO; EST; expressed sequence tag; PCR primer; hybridisation; 

KW probe; blood coagulation disorder; cancer; cellular adhesion disorder; 

KW secreted protein; transmembrane protein. 



OS 


Homo sapiens . 




XX 












PN 


W09946281-A2. 




XX 












PD 


16 


-SEP- 


1999 






XX 












PF 


08 


-MAR- 


1999; 99WO- 


US005028. 


XX 












PR 


10 


-MAR- 


1998, 


; 98US- 


0077450P. 


PR 


11 


-MAR- 


1998, 


; 98US- 


0077632P. 


PR 


11 


-MAR- 


1998, 


r 98US- 


0077641P. 


PR 


11 


-MAR- 


1998, 


? 98US- 


0077649P. 


PR 


12 


-MAR- 


1998, 


; 98US- 


0077791P. 


PR 


13 


-MAR- 


1998, 


; 98US- 


0078004P. 


PR 


17 


-MAR- 


1998, 


; 98US- 


00040220. 


PR 


20 


-MAR- 


1998, 


; 98US- 


0078886P. 


PR 


20 


-MAR- 


1998, 


; 98US- 


0078910P. 


PR 


20 


-MAR- 


1998, 


: 98US- 


0078936P. 


PR 


20- 


-MAR- 


1998, 


• 98US- 


0078939P. 


PR 


25- 


-MAR- 


1998, 


; 98US- 


0079294P. 


PR 


26 


-MAR- 


1998, 


: 98US- 


0079656P. 


PR 


27 


-MAR- 


1998, 


: 98US- 


0079663P. 


PR 


27- 


-MAR- 


1998, 


r 98US- 


0079664P. 


PR 


27- 


-MAR- 


1998, 


; 98US- 


0079689P. 


PR 


27- 


-MAR- 


1998, 


r 98US- 


0079728P. 


PR 


27 


-MAR- 


1998, 


r 98US- 


0079786P. 


PR 


30- 


-MAR- 


1998, 


; 98US- 


0079920P. 


PR 


30- 


-M7VR- 


1998, 


r 98US- 


0079923P. 


PR 


31- 


-MAR- 


1998, 


; 98US- 


0080105P. 


PR 


31- 


-MAR- 


1998, 


: 98US- 


0080107P. 



XX 



PR 


31 


-MAR~ 


1998 


; 98US- 


-0080165P. 


PR 


31 


-MAR- 


-1998 


98US- 


-0080194P. 


PR 


01 


-APR- 


-1998 


98US- 


-0080327P. 


PR 


01 


-APR- 


-1998 


; 98US- 


-0080328P. 


PR 


01 


-APR- 


-1998 


98US- 


-0080333P. 


PR 


01 


-APR- 


-1998 


; 98US- 


-0080334P. 


PR 


08 


-APR- 


-1998 


; 98US- 


-0081049P. 


PR 


08 


-APR- 


-1998 


; 98US- 


-0081070P. 


PR 


08 


-APR- 


-1998 


; 98US- 


-0081071P. 

XV \J XV XV F ^ V 


PR 


09 


-APR- 


-1998 


; 98US- 


-0081195P. 

W W W X — vV X— • 


PR 


09- 


-APR- 


-1998 


; 98US- 


-0081203P. 

XV XV XV \/ ^V X. • 


PR 


09- 


-APR- 


-1998 


9 8US- 


-0081229P. 

XV XV XV L< X> • 


PR 


15 


-APR- 


-1998 


; 98US- 


-0081817P. 

U W W _L_ vV — 1— t X_ t 


PR 

X, > 


15 


-APR- 

X 1^ X x 


-1998 


98US- 


-0081838P 

w w VV _1_ XV U J_ « 


PR 


15 


-APR- 

X 14< X X 


■1998 

XV 


98US- 

r XV XV KV 


-008 1952P 

XV XV \J _X> ^-V -^V L^a X- • 


PR 


15- 


-APR- 


-1998 


r 98US- 


-0081955P 

XV XV XV — ' »—/ X. • 


PR 


21- 


-APR- 


-1998, 


; 98US- 


-0082568P. 

XV XV XV 4_j v-^ XV XV X- * 


PR 


21- 


-APR- 


-1998, 

wV V./ 


; 98US- 

' m*f XV XV hV 


-0082569P 

XV XV XV b 'w' XV —V X. » 


PR 

X. X X 


22 


-APR- 

X Li. X X 


-1998 

-X, _V _V ! 


• 98US- 


-0082700P 

v v v L / xv xv i_ * 


PR 

^ x \ 


22- 


-APR- 


•1998, 

_1_ XV 


r 98US- 


•0082704P 


PR 


22- 


-APR- 

X LL. X X 


-1998 , 

_X* — V wV xv 


: 98US- 

p V V >hV 


-0082804P 

XV XV XV \J XV ^ J_ * 


PR 


23- 


-APR- 


■1998, 


r 98US- 

' ' XV XV 


-0082767P 

XV XV XV / xV / X, * 


PR 


23- 


-APR- 


-1998, 

^V XV j 


r 98US- 

r k/ XV XV kV 


-0082796P 

XV XV XV I ^V W i_ t 


PR 

x. x \ 


27- 


-APR- 


1998 , 

-1- -V *V XV j 


• 98US- 

t — / XV X^ *wJ 


0083336P 

xv xv \J uf w x. * 


PR 


28- 


-APR- 


1998, 

_l_ —V _V XV j 


r 98US- 

r -v \j Vv tw' 


■0083322P 


PR 

X* A \ 


29- 


-APR- 


1998 , 

—V ^ XV J 


: 98US- 

f _/ XV XV I^V 


■0083392P 

xv vy xv x-/ »J v f_i x_ • 


PR 


29- 


-APR- 


1998, 


; 98US- 

w/ XV XV h/ 


0083495P 

XV W XV i — V X. » 


PR 


29- 


-APR- 


1998, 


; 98US- 

" XV XV LV 


•0083496P 

XV XV XV X _V Vv i_ * 


PR 


29- 


-APR- 


1998, 


; 98US- 

f —J W vV b^V 


•0083499P 

VV XV VV -*J ^1 _v V X, « 


PR 

x_ x \ 


29- 


-APR- 


1998 . 

_L_ — V _V Vv j 


* 98US- 

r -V XV \J 


0083500P 

V V U J -s-F* XV XV J_ • 


PR 


29- 


-APR- 


1998, 


r 98US- 

■ —S XV XV tuf 


0083545P 

VV XV XV ^/ ^— ' X ^/ X. * 


PR 


29- 


-APR- 


1998; 


f 98US- 

' XV XV kV 


0083554P 

XV XV XV %V ^_V x. * 


PR 


29- 


-APR- 


1998; 


f 98US- 

' ^-^ XV XV 


0083558P. 

XV XV XV ^V w x. t 


PR 


29- 


-APR- 

X X~ X X 


1998 ; 

-1_ -S XV j 


98US- 

VV 


0083559P 

xv XV Vv ^ «— / V x* » 


PR 

X» X \ 


30- 


-APR- 

X Ll> X x 


1998 j 


98US- 


0083742P 

XV XV W / X Lj J_ » 


PR 


05- 


-MAY- 


1998 ; 

J_ w' XV J 


98US- 

XV XV KV 


0084366P. 

XV XV W X -w' XV Xv X. • 


PR 


06- 


-MAY- 


1998; 


98US- 


0084414P. 

W XV W X X —L, X X> » 


PR 


06- 


-MAY- 


1998; 


98US- 


0084441P. 

W XV XV X X X 1 J— * 


PR 


07- 


-MAY- 

x u. x ^ 


1998; 

XV J 


98US- 

*V XV XV KV 


0084598P 

XV W XV X v Vv X_ • 


PR 


07- 


-MAY- 


1998; 


98US- 

f XV XV LV 


0084600P 

Vv W VV X v xv Xv X. » 


PR 


07- 


-MAY- 


1998 ; 


98US- 


0084627P. 

XV W VV A W C_* I X_ * 


PR 


07- 


-MAY- 


1998; 


98US- 


0084637P. 


PR 


07- 


-MAY- 


1998; 


98US- 


0084639P. 


PR 


07- 


~MAY- 


1998 ; 


98US- 


0084640P. 

w XV XV X XV X XV X. * 


PR 


07- 


-MAY- 


1998; 


98US- 


0084643P. 


PR 


13- 


-MAY- 


1998 ; 


98US- 


0085323P. 


PR 


13- 


-MAY- 


1998; 


98US- 


0085338P. 


PR 


13- 


-MAY- 


1998; 


98US- 


0085339P. 


PR 


15- 


-MAY- 


1998; 


98US- 


0085573P. 


PR 


15- 


-MAY- 


1998 ; 


98US- 

*v Vv Xv 1 ^ 


0085579P. 

\v xv xv v r ^ X* 4 


PR 


15- 


-MAY- 


1998; 


98US- 


0085580P. 


PR 


15- 


-MAY- 


1998; 


98US- 


0085582P. 


PR 


15- 


-MAY- 


1998; 


98US- 


0085689P. 


PR 


15- 


-MAY- 


1998; 


98US- 


0085697P. 


PR 


15- 


-MAY- 


1998; 


98US- 


0085700P. 


PR 


15- 


-MAY- 


1998; 


98US- 


0085704P. 


PR 


18- 


-MAY- 


1998; 


98US- 


0086023P. 



PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
XX 
SQ 



22 


-MAY- 


1998, 


; 98US- 


0086392P. 




-MAY- 


X _7 Z? (J t 


r JUUO 


OOR 64 1 4 P 


22 


-MAY- 


1998, 


; 98US- 


0086430P. 


22 


-MAY- 


1998, 


; 98US- 


0086486P. 


28 


-MAY- 


1998, 


; 98US- 


0087098P. 


28 


-MAY- 


1998, 


r 98US- 


0087106P. 


28 


-MAY- 


1998, 


; 98US- 


0087208P. 


30 


-JUL- 


1998, 


; 98US- 


0094651P. 


11 


-SEP- 


1998, 


; 98US- 


0100038P. 



(GETH ) GENENTECH INC. 

Wood WI, Goddard A, Gurney A, Yuan J, Baker KP, Chen J; 

WPI; 1999-551358/46. 
N-PSDB; AAZ34056. 

New secreted and transmembrane polypeptides and their polynucleotides, 
useful for treating blood coagulation disorders, cancers and cellular 
adhesion disorders . 

Claim 12 ; Fig 73 ; 530pp ; English . 

The present invention describes secreted and transmembrane polypeptides 
and their polynucleotides. The nucleotide sequences are useful as sources 
of probes, primers, for chromosome mapping, and for generation of 
antisense sequences. They can also be used to create transgenic animals. 
The proteins can be used to treat a variety of diseases and disorders, 
depending on their function. Diseases that may be treated include blood 
coagulation disorders, cancers and cellular adhesion disorders. They may 
also be used to raise antibodies. AAZ33891 to AAZ34338, and AAY41685 to 
AAY41774 represent polynucleotide and polypeptide sequence given in the 
exemplification of the present invention 



Sequence 518 AA; 



Query Match 100 
Best Local Similarity 10 0 
Matches 518 ; Conservative 



0%; Score 2687; DB 2; Length 518; 
0%; Pred. No. 8.6e-231; 

0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1 MGALARAL L L P L LAQ W L L RAAP E LAP AP FT L P L RVAAATN RWAP T P G P GT P AE RHAD G L 60 

I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1 MG ALARAL L L P L LAQWL L RAAP E LAP AP FT L P L RVAAATN RWAPT P G P GT P AERHAD GL 60 

61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 



Qy 



241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 




Db 



241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 



QY 



301 IVDSGTTLLRLPQKVFDAWEAV7VRASLIPEESDGFWTGSQLACWTNSETPWSYFPKISI 360 




Db 



301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 



QY 



361 YLRDENS S RS FRI T I LPQL YI QPMMGAGLN YECYRFGI S P STNALVT GATVMEGF YVI FD 420 




Db 



361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 



QY 



Db 




Qy 



481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 




Db 



481 AI LLVLI VLLLLPFRCQRRPRDPEWNDES S LVRHRWK 518 



RESULT 5 
AAY88424 

ID AAY88424 standard; protein; 518 AA. 
XX 

AC AAY88424; 
XX 

DT 03-AUG-2000 (first entry) 
XX 

DE Human aspartyl protease 1 (Aspl) amino acid sequence. 
XX 

KW Aspartyl protease; aspartase; amyloid precursor protein; APP; Asp 1; 

KW Alzheimer's disease; beta secretase site. 

XX 

OS Homo sapiens. 
XX 

PN WO200017369-A2. 
XX 

PD 30-MAR-2000. 
XX 

PF 23-SEP-1999; 99WO-US020881 . 
XX 

PR 24-SEP-1998; 98US-0101594P . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Gurney ME, Bienkowski MJ, Heinrikson RL, Parodi LA, Yan R; 
XX 

DR WPI; 2000-303209/26. 

DR N-PSDB; AAA15661. 
XX 

PT New enzyme designated human aspartase useful in research into Alzheimer's 

PT Disease is capable of cleaving amyloid protein precursor at the beta 

PT secretase site to produce amyloid beta peptide. 
XX 

PS Claim 54; Fig 1; 183pp; English. 
XX 

CC This sequence represents the human aspartyl protease amino acid sequence. 



CC The invention relates to a protease capable of cleaving the beta 

CC secretase site of amyloid precursor protein (APP) . The protease contains 

CC a sequence encoding the amino acid sequence DTG and a sequence encoding 

CC DSG or DTG separated by 100-300 amino acids. When mutated the APP gene 

CC causes an autosomal dominant form of Alzheimer's disease. APP localises 

CC to the cell surface membrane and have a single C-terminal transmembrane 

CC domain. Proteolytic processing of APP produces the amyloid beta protein, 

CC which is possibly very important in Alzheimer's disease. The invention 

CC includes a nucleotide sequence encoding the protease, a vector containing 

CC the nucleotide sequence, and a cell line comprising the vector. Methods 

CC for screening for inhibitors of beta secretase activity are also given in 

CC the invention. The human aspartase protein and nucleotide sequences and 

CC the methods for identifying inhibitors of the protease, are useful in the 

CC treatment of and research in to Alzheimer's disease 
XX 

SQ Sequence 518 AA; 

Query Match 100.0%; Score 2687; DB 3; Length 518; 

Best Local Similarity 100.0%; Pred. No. 8.6e-231; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALLLPLIAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | 
Db 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVT>TGSSNFAVAG 12 0 

I I I I I I I M I I M I I I I I I I I I I I II II I I I I I I I II I M I I I I I I I I II I I I I I I I I I I 
Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FE S EN FFL P G I KWNG I L GLAYAT LAK PSSSLETFFDS LVT QAN I PN VFSMQMC GAGL P VA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVT)SGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVTFD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

Qy 421 RAQKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 RAQ KRVGFAAS PCAE I AGAAVS EISGPFSTE DVAS N C VP AQ S L S E P I LW I VS YALMS VC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 6 
AAB44270 
ID 
XX 



AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
PI 
PI 
PI 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 



AAB44270 standard; protein; 518 AA. 
AAB44270; 

08-FEB-2001 (first entry) 

Human PR0852 (UNQ418) protein sequence SEQ ID NO: 196. 

Human; secreted protein; transmembrane protein; PRO; EST; cytostatic; 
expressed sequence tag; detection; cancer. 

Homo sapiens . 

WO200053756-A2 . 



14 


-SEP- 


2000. 








18 


-FEB- 


2000; 


2 


000WO- 


-US004341. 


08 


-MAR- 


1999; 




99WO- 


-US005028. 


12 


-MAR- 


1999; 




99US- 


-0123957P. 


29 


-MAR- 


1999; 




99US- 


-0126773P. 


21 


-APR- 


1999; 




99US- 


-0130232P. 


28 


-APR- 


1999; 




99US- 


-0131445P. 


14 


-MAY- 


1999; 




99US- 


-0134287P. 


23 


-JUN- 


1999; 




99US- 


-0141037P. 


26 


-JUL- 


1999; 




99US- 


-0145698P. 


29 


-OCT- 


1999; 




99US- 


-0162506P. 


30 


-NOV- 


1999; 




9 9WO- 


-US028313. 


02 


-DEC- 


1999; 




9 9WO 


-US028551. 


02 


-DEC- 


1999; 




99WO- 


-US028565. 


16 


-DEC- 


1999; 




99WO- 


-US030095. 


30 


-DEC- 


1999; 




99WO- 


-US031243. 


30 


-DEC- 


1999; 




99WO- 


-US031274. 


05 


-JAN- 


2000; 


2 


000WO- 


-US000219. 


06 


- JAN- 


2000; 


2 


000WO- 


-US000277. 


06 


-JAN- 


2000; 


2 


000WO- 


-US000376. 



(GETH ) GENENTECH INC. 

Ashkenazi AJ, Baker KP, 
Ferrara N, Filvaroff E, 



Botstein D, Desnoyers L, Eaton DL; 
Fong S, Gao W, Gerber H, Gerritsen 

Hillan KJ; 



ME; 

Roy MA, Shelton DL; 



Goddard A f Godowski PJ, Grimaldi CJ, Gurney AL f 
Kljavin IJ, Kuo SS, Napier MA, Pan J, Paoni NF f 
Stewart TA, Tumas D, Williams PM, Wood WI; 

WPI; 2000-611443/58. 
N-PSDB; AAC78500. 



Novel PRO polypeptides and polynucleotides used in detection methods, to 
target bioactive molecules to specific cells, and to modulate cellular 
activities . 



Claim 12; Fig 73; 636pp; English. 



XX 

CC AAC78458 to AAC78599 represent polynucleotide and EST (expressed sequence 

CC tag) sequences which encode secreted or transmembrane PRO polypeptides. 

CC The PRO polynucleotides and polypeptides have cytostatic activity. The 

CC polynucleotides and polypeptides can be used for detecting the presence 

CC of PRO polypeptides in samples, for linking bioactive molecules to cells 

CC and for modulating biological activities of cells, using the polypeptides 

CC for specific targeting. The polypeptide targeting can be used to kill the 

CC target cells, e.g. for the treatment of cancers. The polypeptide pairs 

CC provide specific targeting of bioactive molecules to cells. AAC78600 to 

CC AAC78987 represent PCR primers and probes used in the isolation of the 

CC PRO polynucleotide sequences 
XX 

SQ Sequence 518 AA; 

Query Match 100.0%; Score 2687; DB 3; Length 518; 

Best Local Similarity 100.0%; Pred. No. 8.6e-231; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MG ALARAL LL P L LAQWL L RAAP E LAP AP FT L P L RVAAAT N RWAP T P G P GT P AE RHAD G L 60 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I II I I I I I I I I I I 

Db 61 AIJVLEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FE S EN FFL P G I KWNGI LGLAYAT LAK PSSSLETFFDS LVTQAN I PNVFSMQMCGAGL P VA 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 181 FES ENFFLPGI KWNGI LGLAYATLAKPSSSLETFFDSLVTQANI PNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVXGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENSSRS FRITILPQLYIQPMMGAGLNYECYRFGI S PSTNALVT GATVMEGFYVI FD 420 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 361 YLRDENS S RS FRI TI LPQLYIQPMMGAGLN YEC YRFGI S P STNALVI GATVMEGFYVI FD 420 

Qy 421 RAQKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCG 480 

II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N C VP AQ S L S E P I LW I VS YALMS VC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 7 



AAU07201 

ID AAU07201 standard; protein; 518 AA. 
XX 

AC AAU07201; 
XX 

DT 24-OCT-2001 (first entry) 
XX 

DE Human aspartyl protease 1 (Asp-1) . 
XX 

KW Human; aspartyl protease 1; Asp-1; nootropic; neuroprotective; 

KW aspartyl protease 2; Asp2 ; amyloid protein precursor; APP; 

KW beta-secretase; Alzheimer's disease. 
XX 

OS Homo sapiens. 
XX 

PN WO200149097-A2 . 
XX 

PD 12-JUL-2001. 
XX 

PF 09-MAY-2001; 2001WO-IB0007 97 . 
XX 

PR 09-MAY-2001; 2001WO-IB00 07 97 . 
XX 

PA (BIEN/) BIENKOWSKI M J. 

PA (GURN/) GURNEY M E. 

PA (HEIN/) HEINRIKSON R L. 

PA (PARO/) PARODI L A. 

PA (YANR/) YAN R. 

XX 

PI Bienkowski MJ, Gurney ME, Heinrikson RL, Parodi LA, Yan R; 
XX 

DR WPI; 2001-502548/55. 

DR N-PSDB; AAS11701. 
XX 

PT Novel purified polypeptide comprising fragment of mammalian aspartyl 

PT protease 2, lacking Asp2 transmembrane domain and retaining beta 

PT secretase activity of Asp2 useful for identifying inhibitors of Asp2 

PT activity. 

XX 

PS Example 2; Fig 1; 185pp; English. 
XX 

CC The invention relates to a novel purified polypeptide comprising a 

CC fragment of mammalian aspartyl protease 2 (Asp2) protein which lacks the 

CC Asp2 transmembrane domain and the Asp2 protein, and where the polypeptide 

CC and the fragment retain the beta-secretase activity of the mammalian Asp2 

CC protein. Also included is an isoform of amyloid protein precursor (APP) 

CC comprising the amino acid sequence of a APP or its fragment containing an 

CC APP cleavage site recognisable by a mammalian beta-secretase, and further 

CC comprising two lysine residues at the carboxyl terminus of the amino acid 

CC sequence of the mammalian APP or APP fragment. The polypeptides are used 

CC for assaying for modulators of beta-secretase activity; identifying 

CC agents that inhibit the APP processing activity of human Asp2 aspartyl 

CC protease (Hu-Asp2); identifying agents that modulate the activity of Asp2 

CC ; and for reducing cellular production of amyloid beta (Abeta) from APP. 

CC Agents identified by the above methods are useful for treating 

CC Alzheimer's disease; and for identifying modulators of amyloid-beta 

CC (Abeta) peptide production, for use in designing therapeutics for the 



CC treatment or prevention of Alzheimer's disease. Probes and primers 

CC derived from Asp nucleic acid sequences are useful for detecting Hu-Asp 

CC nucleic acids in in vitro assays and in Northern and Southern blots. The 

CC present sequence represents the amino acid sequence of human Asp-1 

XX 

SQ Sequence 518 AA; 

Query Match 100.0%; Score 2687; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 8.6e-231; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGAIAJ^ALLLPLIAQWLLRAAPEIAPAPFTLPLR 60 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MGA1ARALLLPLIAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I II I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 181 FESENFFLPGIKWNGILGLAYATIAKPSSSLETFFDSLWQANIPNVFSMQMCGAGLPVA 24 0 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 11 I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 361 YLRDENS SRS FRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FD 420 

Qy 421 RAQ K RVG FAAS P CAE I AG AAVS EISGPFSTE D VAS N C VP AQ S L S E P I L W I VS YALM S VC G 480 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 421 RAQ K RVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N C VP AQ S L S E P I LW I VS YALMS VC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 81 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

RESULT 8 
AAE10628 

ID AAE10628 standard; protein; 518 AA. 
XX 

AC AAE10628; 
XX 

DT 10-DEC-2001 (first entry) 
XX 

DE Human aspartyl protease 1 (hu-Aspl) protein. 



XX 

KW Human; aspartyl protease 1; Aspl; amyloid precursor protein; APP; 

KW Alzheimer's disease; AD; dementia; neurofibrillary tangle; gliosis; 

KW amyloid plaque; neuronal loss; proteolytic; nootropic; neuroprotective; 

KW chromosome 21. 

XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualif iers 

FT Peptide 1. .20 

FT /label= Signal__peptide 

FT Protein 21. .518 

FT /note= "Mature human aspartyl protease 1" 

FT Domain 4 69. .492 

FT /label= Transmembrane_domain 

XX 

PN GB2357767-A. 
XX 

PD 04-JUL-2001. 
XX 

PF 22-SEP-2000; 2000GB-00023315 . 
XX 

PR 23-SEP-1999; 99US-004 04 133 . 

PR 23-SEP-1999; 99US-01554 93P . 

PR 23-SEP-1999; 99WO-US0208 81 . 

PR 13-OCT-1999; 99US-004 16901 . 

PR 06-DEC-1999; 99US-01692 32P . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Bienkowkski MJ, Gurney M; 
XX 

DR WPI; 2001-444208/48. 

DR N-PSDB; AAD17864. 
XX 

PT Polypeptide comprising fragments of human aspartyl protease with amyloid 

PT precursor protein processing activity and alpha-secretase activity, for 

PT identifying modulators useful in treating Alzheimer's disease. 
XX 

PS Claim 36; Fig 1; 187pp; English. 
XX 

CC The patent discloses human aspartyl protease 1 (hu-Aspl) or modified Aspl 

CC proteins which lack transmembrane domain or amino terminal domain or 

CC cytoplasmic domain and retains alpha-secretase activity and amyloid 

CC protein precursor (APP) processing activity. The proteins of the 

CC invention are useful for assaying hu-Aspl alpha-secretase activity, which 

CC in turn is useful for identifying modulators of hu-Aspl alpha-secretase 

CC activity, where modulators that increase hu-Aspl alpha-secretase activity 

CC are useful for treating Alzheimer's disease (AD) which causes progressive 

CC dementia with consequent formation of amyloid plaques, neurofibrillary 

CC tangles, gliosis and neuronal loss. Hu-Aspl protease substrate is useful 

CC for assaying hu-Aspl proteolytic activity, by contacting hu-Aspl protein 

CC with the substrate under acidic conditions and determining the level of 

CC hu-Aspl proteolytic activity. The present sequence is Aspl protein from 

CC human. Aspl gene is localised on chromosome 21 
XX 

SQ Sequence 518 AA; 



Query Match 100.0%; Score 2687; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 8.6e-231; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARAL LLP LLAQWL L RAAP E LAPAP FT L P L RVAAATNR VVAPT PGP GT P AE RHAD GL 60 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MGALARALLLPLI^QWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I II I i 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 IVT3SGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS S RS FRI TI LPQLYIQ PMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FD 420 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 YLRDENS S RS FRI TI LPQLYI QPMMGAGLN YECYRFGI S PSTNALVI GATVMEGFYVI FD 420 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N C VP AQ S L S E P I LW I VS YALMS VC G 480 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I 
Db 421 RAQKRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N CVP AQ S L S E P I LW I VS YALMS VC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 

Db 481 AILLVLIVliLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

RESULT 9 
AAE10656 

ID AAE10656 standard; protein; 518 AA. 
XX 

AC AAE10656; 
XX 

DT 10-DEC-2001 (first entry) 
XX 

DE Human-Asp 1 protein lacking TM domain and containing (His) 6 tag. 
XX 

KW Human; aspartyl protease 1; Aspl; amyloid precursor protein; APP; 

KW Alzheimer's disease; AD; dementia; neurofibrillary tangle; gliosis; 

KW amyloid plaque; neuronal loss; proteolytic; nootropic; neuroprotective. 

XX 

OS Homo sapiens . 



OS Synthetic. 
XX 

PN GB2357767-A. 
XX 

PD 04-JUL-2001. 
XX 

PF 22-SEP-2000; 2000GB-00023315 . 
XX 

PR 23-SEP-1999 



PR 23-SEP-1999 

PR 23-SEP-1999 

PR 13-OCT-1999 

PR 06-DEC-1999 



99US-00404133. 
99US-0155493P. 
99WO-US020881 . 
99US-00416901. 
99US-0169232P. 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Bienkowkski MJ, Gurney M; 
XX 

DR WPI; 2001-444208/48. 
XX 

PT Polypeptide comprising fragments of human aspartyl protease with amyloid 

PT precursor protein processing activity and alpha-secretase activity, for 

PT identifying modulators useful in treating Alzheimer's disease. 
XX 

PS Example 14; Page 155-156; 187pp; English. 
XX 

CC The patent discloses human aspartyl protease 1 (hu-Aspl) or modified Aspl 

CC proteins which lack transmembrane domain or amino terminal domain or 

CC cytoplasmic domain and retains alpha-secretase activity and amyloid 

CC protein precursor (APP) processing activity. The proteins of the 

CC invention are useful for assaying hu-Aspl alpha-secretase activity, which 

CC in turn is useful for identifying modulators of hu-Aspl alpha-secretase 

CC activity, where modulators that increase hu-Aspl alpha-secretase activity 

CC are useful for treating Alzheimer's disease (AD) which causes progressive 

CC dementia with consequent formation of amyloid plaques, neurofibrillary 

CC tangles, gliosis and neuronal loss. Hu-Aspl protease substrate is useful 

CC for assaying hu-Aspl proteolytic activity, by contacting hu-Aspl protein 

CC with the substrate under acidic conditions and determining the level of 

CC hu-Aspl proteolytic activity. The present sequence is human Asp 1 protein 

CC lacking a transmembrane (TM) domain and containing (His) 6 tag. This 

CC sequence is generated from human Asp 1 protein by the deletion of its C- 

CC terminal TM domain and addition of hexa-histidine tag at its C-terminus 
XX 

SQ Sequence 518 AA; 

Query Match 100.0%; Score 2687; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 8.6e-231; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALLL P LLAQWLL RAAP E LAP AP FT L P L RVAAATN RWAP T P G P GT P AERHAD G L 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MGALARAL L L P L LAQ W L L RAAP E LAP AP FT L P L RVAAAT N RWAP T P G P GT P AE RHAD G L 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLI GT PPQKLQI LVDTGS SNFAVAG 120 



Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I 1 M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I > I I I I I I I I I I I I I I I I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGIAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I M 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

Qy 421 RAQ K R VG FAAS P CAE I AG AAVS EISGPFSTE D VA S N C V P AQ S L S E P I L W I VS Y ALM S VC G 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 RAQ K RVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N C VP AQ S L S E P I LW I VS Y ALMS VC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 10 
AAE06858 

ID AAE06858 standard; protein; 518 AA, 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 



AAE06858; 

23-OCT-2001 (first entry) 

Human aspartyl protease 1 (Hu-Aspl) protein. 

Human; aspartyl protease 1; Asp 1; beta-amyloid precursor protein; APP; 
beta-secretase; Alzheimer's disease; dementia; amyloid plaque; gliosis; 
neurofibrillary tangle; neuronal loss; amyloid-beta peptide; nootropic; 
neuroprotective; antisense therapy; gene therapy; chromosome 21. 

Homo sapiens . 



Key 

Peptide 
Protein 
Domain 

WO200150829-A2 
19-JUL-2001. 



Location/ Qualifiers 
1. .20 

/label= Signal_peptide 
21. .518 

/note= "Mature human aspartyl protease 1 (Hu-Aspl) " 
469. .492 

/label= Transmembrane domain 



XX 

PF 09-MAY-2001; 2001WO-IB000799 . 
XX 

PR 09-MAY-2001; 2001WO-IB000799 . . 
XX 

PA (BIEN/ ) BIENKOWSKI M J. 

PA (GURN/) GURNEY ME. 

PA (HEIN/) HEINRIKSON R L. 

PA (PARO/) PARODI L A. 

PA (YANR/) YAN R. 

XX 

PI Bienkowski MJ, Gurney ME, Heinrikson RL, Parodi LA, Yan R; 
XX 

DR WPI; 2001-483072/52. 

DR N-PSDB; AAD13020. 
XX 

PT Novel purified polypeptide comprising fragment of mammalian aspartyl 

PT protease 2, lacking Asp2 transmembrane domain and retaining beta 

PT secretase activity of Asp2 useful for identifying inhibitors of Asp2 

PT activity. 

XX 

PS Example 2; Fig 1; 185pp; English. 
XX 

CC The invention relates to human aspartyl proteases (Hu-Asp) , beta-amyloid 

CC precursor protein (APP) isoforms and their corresponding DNA molecules. 

CC Human aspartyl proteases can act as beta-secretase proteases useful for 

CC treating Alzheimer's disease. APP isoforms are useful for identifying 

CC modulators of amyloid-beta peptide production, for use in designing 

CC therapeutics for the treatment and prevention of Alzheimer 1 s disease, 

CC dementia, formation of amyloid plaques, neurofibrillary tangles, gliosis 

CC and neuronal loss. APP isoforms are also used in methods for identifying 

CC inhibitors and modulators of human Asp2 activity. The invention relates 

CC to a method for identifying agents that modulate the activity of human 

CC aspartyl protease Asp2 . Amyloid-beta peptides obtained from APP are used 

CC as a means to screen in cellular assays for the inhibitors of beta- and 

CC gamma- secretase. Hu-Asp DNA fragments are useful as probes or primers in 

CC polymerase chain reactions (PCR) . The probes are useful for detecting Hu- 

CC Asp nucleic acids in in vitro assays and in Northern and Southern blots. 

CC The present sequence is human aspartyl protease 1 (Hu-Aspl) . Hu-Asp 1 

CC gene is localised on chromosome 21 
XX 

SQ Sequence 518 AA; 

Query Match 100.0%; Score 2687; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 8.6e-231; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALL L P L LAQWLLRAAP ELAPAP FT L P LRVAAATN RWAP T P G P GT PAE RHAD GL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I 

Db 1 MGALARAL L L P L LAQW L L RAAP E LAP AP FT L P L RVAAAT N RWAP T P G P GT PAE RHAD G L 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ALAL E P ALAS P AGAAN FLAMVDN LQ GD S GRG Y Y L EML I GT P P Q KLQ I L VDT G S S N FAVAG 120 



Qy 



121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 18 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I 



Db 



121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 18 0 



Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I M I I I I I I I II I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I II I I I I I I I M I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 42 0 

I 1 I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N C VPAQ S L S E P I LW I VS YALMS VC G 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 RAQKRVGFAASPC7VEI AGAAVS EI SGPFSTEDVASNCVPAQSLSEP I LWIVS YALMS VCG 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I II I I M I I I I I I I I I I I I I I I I I I I II I 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 11 
AAE02608 

ID AAE02608 standard; protein; 518 AA. 
XX 

AC. AAE02608; 
XX 

DT 10-AUG-2001 (first entry) 
XX 

DE Human Aspartyl protease-1 (Asp-1) deltaTM (His) 6 protein. 
XX 

KW Human; alpha-secretase; amyloid precursor protein; APP; therapy; 

KW. Alzheimer's disease; antialzheimer 1 s ; aspartyl protease 1; Aspl; 

KW beta-secretase; Asp-1 deltaTM (His) 6 protein. 
XX 

OS Homo sapiens . 

OS Synthetic. 
XX 

PN WO200123533-A2. 
XX 

PD 05-APR-2001. 
XX 

PF 22-SEP-2000; 2000WO-US026080 . 
XX 

PR 23-SEP-1999; 99US-0155493P . 

PR 23-SEP-1999; 99WO-US020881 . 

PR 13-OCT-1999; 99US-004 16901 . 

PR 06-DEC-1999; 99US-0169232P . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 



PI Gurney M, Bienkowski MJ; 
XX 

DR WPI; 2001-290516/30. 
XX 

PT Enzymes that cleave the alpha-secretase site of the amyloid precursor 

PT protein, useful for the treatment of Alzheimer f s disease. 

XX 

PS Example 14; Page 183-184; 189pp; English, 
XX 

CC The present invention relates to enzymes for cleaving the alpha™ 

CC secretase site of the amyloid precursor protein (APP) and methods of 

CC identifying those enzymes. The methods may be used to identify enzymes 

CC that may be used to cleave the alpha-secretase cleavage site of the APP 

CC protein. The enzymes may be used to treat or modulate the progress of 

CC Alzheimer's disease. The present sequence is human Aspartyl protease-1 

CC (Asp-1) deltaTM (His) 6 protein which is used for the expression of pre- 

CC pro-human-Aspartyl protease 1 (Aspl) . This protein is obtained by 

CC replacing C-terminal transmembrane and cytoplasmic domains with a 

CC hexahistidine purification tag in the human Aspartyl protease 1 

XX 

SQ Sequence 518 AA; 

Query Match 100.0%; Score 2687; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 8.6e-231; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MGALARALLLPLLAQWLLR7VAPEIAPAPFTLPLRVAAATNRWAPTPGPGTPAERHi\DGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I 

Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGI EPSLYKGDIWYT PI KEEWYYQI EI LKLEI GGQSLNLDCREYNADKA 300 

I II I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEI GGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAV7VRASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

I I I I I I II I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 YLRDENS SRS FRITI LPQL YIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FD 420 

Qy 421 RAQKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCG 480 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I II 

Db 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N C VPAQ SLSEPILWIVS YALMS VC G 480 



Qy 481 AILLVLIVLLLLPFRCQRRPRDPEVWDESSLVRHRWK 518 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 12 






AAE02580 






ID 


AAE02580 standard; protein; 518 AA. 


XX 








AC 


AAE02580; 






XX 








DT 


10-AUG-2001 


(first entry) 


XX 








DE 


Human aspartyl 


protease 1 (Asp 1) . 


XX 








KW 


Human; alpha 


-secretase; amyloid precursor prot 


KW 


Alzheimer 1 s 


disease; antialzheimer ! s ; aspartyl 


KW 


bet a- seer etas e 


; chromosome 21. 


XX 








OS 


T-foTno cam f^Ti <=: 


* 




XX 








FH 


Kev 




TiDc^ti nn / Oils 1 i "F i 


FT 


Peoti de 




T 20 


FT 






/label= Signal peptide 


FT 


Pent ide 




22. .62 


FT 






/label= Asp 1 prepropeptide 


FT 


Peptide 




23. .62 


FT 






/label= Asp 1 propeptide 


FT 


Protein 




63. .518 


FT 






/labels Mature human Asp 1 pro 


FT 






/note= "Specifically claimed" 


FT 


Active-site 




87. .89 


FT 






/label= Active site 1 


FT 


Active-site 




110. .113 


FT 






/labels Active site 2 


FT 


Active-site 




303. .305 


FT 






/label= Active site 3 


FT 


Domain 




469. .492 


FT 






/label= Transmembrane domain 


FT 


Domain 




493. .518 


FT 






/label= Cytoplasmic domain 


FT 


Region 




497. .518 


FT 






/note= "Peptide #1" 


XX 








PN 


WO200123533- 


A2 


* 


XX 








PD 


05-APR-2001. 






XX 








PF 


22-SEP-2000; 


2000WO-US026080. 


XX 








PR 


23-SEP-1999; 




99US-0155493P. 


PR 


23-SEP-1999; 




99WO-US020881. 


PR 


13-OCT-1999; 




99US-00416901. 


PR 


06-DEC-1999; 




99US-0169232P. 


XX 








PA 


(PHAA ) PHARMACIA & UPJOHN CO. 



PI Gurney M, Bienkowski MJ; 
XX 

DR WPI; 2001-290516/30. 

DR N-PSDB; AAD06738. 
XX 

PT Enzymes that cleave the alpha-secretase site of the amyloid precursor 

PT protein, useful for the treatment of Alzheimer 1 s disease. 

XX 

PS Claim 29; Fig 1; 189pp; English. 
XX 

CC The present invention relates to enzymes for cleaving the alpha- 

CC secretase site of the amyloid precursor protein (APP) and methods of 

CC identifying those enzymes. The methods may be used to identify enzymes 

CC that may be used to cleave the alpha-secretase cleavage site of the APP 

CC protein. The enzymes may be used to treat or modulate the progress of 

CC Alzheimer's disease. The present sequence is human aspartyl protease 1 

CC (Asp 1) . Asp 1 has alpha-secretase protease and beta-secretase protease 

CC activities. Asp 1 gene is located on chromosome 21 
XX 

SQ Sequence 518 AA; 

Query Match 100.0%; Score 2687; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 8.6e-231; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGAiARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 1 MGALARALLLPLIAQWLLR7\APELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I II I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 61 ALALEPALAS PAG7U\NFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I II I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAVVEAV7VRASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS S RS FRI T I LPQL YI QPMMGAGLN YEC YRFGI S P STNALVI GATVMEGFYVI FD 420 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 361 YLRDENS SRS FRIT I LPQLYIQPMMGAGLNYECYRFGISP STNALVI GATVMEGFYVI FD 420 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS E I SGP FS T EDVAS NCVP AQ S L S E P I LWI VS YALMS VCG 480 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 421 RAQ KRVG FAAS P CAE I AGAAVS E I S G P FS T EDVAS N CVP AQ S L S E P I LW I VS YALMS VC G 480 



Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I 11 I I I I I I I I I I I I I I II I I 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 13 
AAU29059 

ID AAU29059 standard; protein; 518 AA. 
XX 

AC AAU29059; 
XX 

DT 18-DEC-2001 (first entry) 
XX 

DE Human PRO polypeptide sequence #3 6. 
XX 

KW PRO polypeptide; mammal; tumour; cancer; human; cattle; horse; sheep; 
KW dog; cat; pig; goat; rabbit; tumour necrosis factor alpha; TNF-alpha; 
KW blood; chondrocyte cell; cell proliferation; cell differentiation; colon; 
KW adrenal; lung; breast; prostate; rectum; cervix; liver; genetic disorder. 
XX 

OS Homo sapiens. 
XX 

PN WO200168848-A2. 
XX 



PD 


20 


-SEP- 


-2001 






XX 












PF 


28 


-FEB- 


-2001; 2001WO- 


■US006520. 


XX 












PR 


01 


-MAR- 


■2000, 


; 2000WO- 


■US005601. 


PR 


02 


-MAR- 


-2000, 


; 2000WO- 


US005841. 


PR 


03 


-MAR- 


-2000, 


? 2000US- 


■0187202P. 


PR 


06 


-MAR- 


-2000, 


; 2000US- 


•0186968P. 


PR 


14 


-MAR- 


-2000, 


; 2000US- 


0189320P. 


PR 


14 


-MAR- 


-2000, 


; 2000US- 


■0189328P. 


PR 


15 


-MAR- 


-2000, 


r 2000WO- 


US006884. 


PR 




-MAR- 


•2000, 


• 2000US- 


•0190828P. 


PR 


21 


-MAR- 


-2000, 


? 2000US- 


■0191007P. 


PR 


21 


-MAR- 


2000, 


: 2000US- 


0191048P. 


PR 


21 


-MAR- 


■2000, 


; 2000US- 


0191314P. 


PR 


28 


-MAR- 


■2000, 


; 2000US- 


0192655P. 


PR 


29 


-MAR- 


2000, 


r 2000US- 


0193032P. 


PR 


29 


-MAR- 


■2000, 


; 2000US- 


0193053P. 


PR 


30 


-MAR- 


■2000, 


r 2000WO- 


US008439. 


PR 


04 


-APR- 


2000, 


: 2000US- 


0194449P. 


PR 


04 


-APR- 


2000, 


r 2000US- 


0194647P. 


PR 


11 


-APR- 


2000, 


r 2000US- 


0195975P. 


PR 


11 


-APR- 


2000, 


; 2000US- 


0196000P. 


PR 


11 


-APR- 


2000, 


f 2000US- 


0196187P. 


PR 


11 


-APR- 


2000, 


r 2000US- 


0196690P. 


PR 


11 


-APR- 


2000; 


r 2000US- 


0196820P. 


PR 


18 


-APR- 


2000; 


: 2000US- 


0198121P. 


PR 


18 


-APR- 


2000; 


; 2000US- 


0198585P. 


PR 


25 


-APR- 


2000; 


: 2000US- 


0199397P. 


PR 


25 


-APR- 


2000; 


; 2000US- 


0199550P. 


PR 


25- 


-APR- 


2000; 


f 2000US- 


0199654P. 


PR 


03- 


-MAY- 


2000; 


r 2000US- 


0201516P. 



PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
XX 
SQ 



17-MAY- 
2 2 -MAY- 
30-MAY- 
02-JUN- 
05-JUN- 
28-JUL- 
22-AUG- 
24-AUG- 
08-NOV- 
01-DEC- 
20-DEC- 



-2000 
•2000 
-2000 
-2000 
■2000 
■2000 
■2000 
2000 
2000 
2000 
2000 



2000WO- 

2000WO 

2000WO- 

2000WO- 

2000US- 

2000WO- 

2000US- 

2000WO- 

200 0WO- 

2000WO- 

2000WO- 



-US013705. 
•US014042 . 
■US014941. 
-US015264. 
0209832P. 
■US020710. 
•00644848. 
-US023328. 
•US030952. 
•US032678. 
■US034956. 



Gurney AL; 



(GETH ) GENENTECH INC. 

Baker KP, Chen J, Desnoyers L, Goddard A, Godowski PJ, 
Pan J, Smith V, Watanabe CK, Wood WI, Zhang Z; 

WPI; 2001-602746/68. 
N-PSDB; AAS45960. 



Novel nucleic acids encoding PRO polypeptides, used to diagnose the 
presence of tumors, such as prostate and breast tumors, in mammals and to 
screen for modulators of the compounds. 

Claim 11; Fig 72; 774pp; English. 

Sequences AAU29024-AAU29328 represent PRO polypeptides of the invention. 
The PRO polypeptides and their associated nucleic acids can be used to 
detect the presence of a tumour in a mammal by comparing the level of 
expression of a PRO polypeptide in a test sample of cells from the animal 
and a control sample of normal cells, whereby a higher level of 
expression in the test sample indicates the presence of a tumour in the 
mammal. Mammals include dogs, cats, cattle, horses, sheep, pigs, goats 
and rabbits but are preferably human. The polypeptides can be used to 
stimulate tumour necrosis factor (TNF) alpha release from human blood, 
when contacted with it. A specific polypeptide can be used to stimulate 
the proliferation or differentiation of chondrocyte cells. The PRO 
proteins can be used to determine the presence of tumours and also 
susceptibility to tumour development, particularly adrenal, lung, colon, 
breast, prostate, rectal, cervical, or liver tumours, in mammalian 
subjects. The oligonucleotide probes specific for the PRO nucleic acids 
can be used for genetic analysis of individuals with genetic disorders 

Sequence 518 AA; 



Query Match 100.0%; Score 2687; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 8.6e-231; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALAKALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATN 60 

I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRVVAPTPGPGTPAERHADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDS GRG YYLEML I GT P PQKLQI LVDTGS SNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I 

Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 



Qy 121 TPHS YI DT YFDTERS ST YRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNI AT I 180 

I M I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | I I I I I I I I I I I I I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQ7\NIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS S RS FRI TI LPQLYI QPMMGAGLNYEC YRFGI S P STNALVI GATVMEGFYVI FD 420 

I I I I II I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 YLRDENS SRS FRI TI LPQLYI QPMMGAGLNYECYRFGISPSTNALVI GATVMEGFYVI FD 420 

Qy 421 RAQKRVGF7\ASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCG 480 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 421 RAQ KRVG FAAS P CAE I AGAAVS E I S G P FS T EDVAS N CVP AQ SLSEPILWI VS YALMS VC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 14 
AAU06602 

ID AAU06602 standard; protein; 518 AA. 
XX 

AC AAU06602; 
XX 

DT 24-OCT-2001 (first entry) 
XX 

DE Human Aspartyl protease 1 (Aspl) . 
XX 

KW Human; Aspartyl protease; Aspl; Asp2; beta-secretase; nootropic; 

KW neuroprotective; amyloid protein precursor; APP; Alzheimer's disease; 

KW amyloid-beta; Abeta. 

XX 

OS Homo sapiens. 
XX 

PN WO200149098-A2. 
XX 

PD 12-JUL-2001. 
XX 

PF 09-MAY-2001; 2001WO-IB000798 . 
XX 

PR 09-MAY-2001; 2001WO-IB000798 . 
XX 

PA (BIEN/) BIENKOWSKI M J. 

PA (GURN/) GURNEY M E. 

PA (HEIN/) HEINRIKSON R L. 

PA (PARO/) PARODI L A. 



PA (YANR/) YAN R. 
XX 

PI Bienkowski MJ, Gurney ME, Heinrikson RL, Parodi LA, Yan R; 
XX 

DR WPI; 2001-502549/55. 

DR N-PSDB; AAS11516. 
XX 

PT Novel purified polypeptide comprising fragment of mammalian aspartyl 

PT protease 2, lacking Asp2 transmembrane domain and retaining beta 

PT secretase activity of Asp2 useful for identifying inhibitors of Asp2 

PT activity. 

XX 

PS Example 2; Fig 1; 185pp; English. 
XX 

CC The invention relates to a purified polypeptide comprising a fragment of 

CC mammalian aspartyl protease (Asp) 2 protein which lacks the Asp2 

CC transmembrane domain and the Asp2 protein, and where the polypeptide and 

CC the fragment retain the beta-secretase activity of the mammalian Asp2 

CC protein. The invention also details polynucleotides for the Asp proteins 

CC and vectors expressing them, and a polypeptide (isoform of amyloid 

CC protein precursor (APP) ) comprising the amino acid sequence of an APP or 

CC its fragment containing an APP cleavage site recognizable by a mammalian 

CC beta-secretase, and further comprising two lysine residues at the 

CC carboxyl terminus of the amino acid sequence of the mammalian APP or APP 

CC fragment. Also included in the invention are methods of identifying 

CC modulators or inhibitors of Asp2 . Modulators and inhibitors of Asp2 are 

CC useful for treating Alzheimer's disease. APP is useful in methods for 

CC identifying inhibitors or modulators of human Asp2 activity and amyloid- 

CC beta (Abeta) peptide production. APP is also useful in designing 

CC therapeutics for the treatment or prevention of Alzheimer's disease. APP 

CC comprising the APP-Sw-beta-secretase peptide sequence (NLDA) , which is 

CC associated with increased levels of Abeta processing is useful in assays 

CC relating the Alzheimer's research. The expression vector is useful for 

CC recombinantly expressing APP. Nucleic acids that hybridise to Asp 

CC oligonucleotides are useful as probes or primers. The probes are useful 

CC for detecting Hu-Asp nucleic acids in in vitro assays and in Northern and 

CC Southern blots . The present sequence is human Aspl 

XX 

SQ Sequence 518 AA; 

Query Match 100.0%; Score 2687; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 8.6e-231; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALLL P LLAQWLL RAAP E LAP AP FT L P L RVAAATN RWAP T P G P GT P AE RHAD GL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MGALARAL L L P L LAQWLL RAAP E LAP AP FT L P L RVAAAT N RWAP T P G P GT P AE RHAD GL 60 

Qy 61 ALALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 



QY 



181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I ! I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I M I I I I I I I I I I M I I I I I I I 
Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS S RS FRI T I L PQLYI QPMMGAGLN YEC YRFGI S P STNALVI GAT VMEGFYVI FD 420 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I II M I I I 
Db 361 YLRDENS SRS FRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FD 420 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N C VP AQ SLSEPILWIVS YALMS VC G 480 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVASNC VP AQ S L S E P I LW I VS YALM S VC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 15 
ABB06531 

ID ABB06531 standard; protein; 518 AA. 
XX 

AC ABB06531; 
XX 

DT 31-MAY-2002 (first entry) 
XX 

DE Human aspartyl protease 1 protein SEQ ID NO: 125. 
XX 

KW Beta-secretase; enzyme; cleavage site; amyloid protein precursor; APP; 

KW aspartyl protease; neuroprotective; nootropic; beta-secretase inhibitor; 

KW Alzheimer's disease. 
XX 

OS Homo sapiens. 
XX 

PN WO200206306-A2. 
XX 

PD 24-JAN-2002. 
XX 

PF 19-JUL-2001; 2001WO-US023035 . 
XX 

PR 19-JUL-2000; 2000US-0219795P . 

PR 12-MAR-2001; 2001US-0275251P . 
XX 

PA (PHAA ) PHARMACIA & UPJOHN CO. 
XX 

PI Yan R, Tomasselli AG, Gurney ME, Emmons TL, Bienkowski MJ; 

PI Heinrikson RL; 

XX 

DR WPI; 2002-216995/27. 
XX 

PT Novel substrates for human aspartyl protease useful for identifying 



PT modulators of beta secretase activity of aspartyl protease for treating 

PT Alzheimer 1 s disease . 

XX 

PS Disclosure; Page 161-162; 188pp; English. 
XX 

CC The present invention describes an isolated peptide (I) comprising a 

CC sequence of at least four amino acids, where the peptide is a substrate 

CC for conducting aspartyl protease assays. (I) has neuroprotective and 

CC nootropic activities, and can be used as an inhibitor of beta-secretase 

CC activity. A beta-secretase modulator from the present invention can be 

CC used for inhibiting beta-secretase activity in vivo, and in the 

CC manufacture of a medicament for the treatment of Alzheimer 1 s disease, 

CC Pharmaceutical compositions from the present invention can be used for 

CC treating a disease or condition characterised by an abnormal beta- 

CC secretase activity. (I) is useful for identifying agents that modulate 

CC the activity of human Asp2 aspartyl protease (Hu-Asp2) . (I) is useful as 

CC a core structure to construct derivatives. ABL49914 to ABL49925 and 

CC ABB06409 to ABB06593 represent sequences used in the exemplification of 

CC the present invention 

XX 

SQ Sequence 518 AA; 

Query Match 100.0%; Score 2687; DB 5; Length 518; 

Best Local Similarity 100.0%; Pred. No. 8.6e-231; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALL L P L LAQW L L RAAP ELAP AP FT L P L RVAAATNRWAPT P G P GT PAE RHADGL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRVVAPTPGPGT PAE RHADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 FESENFFLPGI KWNGI LGLAYATLAKPS S S LETFFDSLVTQANI PNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVI)SGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQIACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N C VP AQ S L S E P I LW I VS YALMS VCG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 RA.QKRVG FAAS PCAEI AGAAVS EI SGPFSTEDVASNCVPAQSLSEP I LWIVSYALMSVCG 480 



Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I M I I I 

Db 481 AILLVLIVLLLLPFRCQRRPRDPEVWDESSLVRHRWK 518 

Search completed: February 28, 2004, 09:09:16 
Job time : 64.4242 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: February 28, 2004, 09:08:05 ; Search time 18.5 Seconds 

(without alignments ) 
1445.527 Million cell updates/sec 

Title: US-09-668-314C-2 
Perfect score: 2687 

Sequence: 1 MGALARALLLPLLAQWLLRA RPRDPEWNDESSLVRHRWK 518 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 389414 seqs, 51625971 residues 



Total number of hits satisfying chosen parameters: 389414 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/2/iaa/5A_COMB.pep: * 

2 : /cgn2_6/ptodata/2/iaa/5B_COMB.pep : * 

3 : /cgn2_6/ptodata/2/iaa/ 6A_COMB . pep : * 

4 : /cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 

5: /cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep: * 

6: / cgn2_6/ptodata/2/iaa/backf ilesl .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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US-08-999-723-2 

; Sequence 2, Application US/08999723A 

; Patent No. 6025180 

; GENERAL INFORMATION: 

; APPLICANT: Powell, David J. 

APPLICANT: Southan, Christopher 
; APPLICANT : Chapman, Conrad G. 
; APPLICANT: Evans, Joanne R. 
; TITLE OF INVENTION: ASP1 

FILE REFERENCE: GH702 62 
; CURRENT APPLICATION NUMBER: US/08/999, 723A 
; CURRENT FILING DATE: 1997-10-06 
; NUMBER OF SEQ ID NOS : 2 
; SOFTWARE : PatentlnVer. 2.0 
; SEQ ID NO 2 

LENGTH: 518 
TYPE: PRT 



; ORGANISM: Homo sapiens 
US-08-999-723-2 

Query Match 100.0%; Score 2687; DB 3; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.2e-243; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MG7VLARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Dfo 1 MGALARALLLPLIAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 18 0 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATL7VKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLA/XiGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS SRS FRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVI GATVMEGFYVI FD 420 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS NCVP AQ S L S E P I LWI VS YALMS VC G 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS NCVP AQ S L S E P I LWI VS YALMS VC G 480 

Qy .481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 2 
US-09-434-427-2 

; Sequence 2, Application US/09434427 

; Patent No. 6162630 

; GENERAL INFORMATION: 

; APPLICANT: POWELL, DAVID J. 

; APPLICANT: SOUTHAN, CHRISTOPHER 

; APPLICANT: CHAPMAN, CONRAD G. 

; APPLICANT: EVANS, JOANNE R. 

; TITLE OF INVENTION: ASP1 

FILE REFERENCE: GH-70262-D1 
; CURRENT APPLICATION NUMBER: US/09/434,427 

CURRENT FILING DATE: 1999-11-04 



; EARLIER APPLICATION NUMBER: US 08/999,723 

EARLIER FILING DATE: 1997-10-06 
; EARLIER APPLICATION NUMBER: UK 9626022.9 
; EARLIER FILING DATE: 1996-12-14 
; NUMBER OF SEQ ID NOS : 2 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 2 

LENGTH: 518 
; TYPE: PRT 

ORGANISM: HOMO SAPIENS 
US-09-434-427-2 

Query Match 100.0%; Score 2687; DB 3; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.2e-243; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARAL L L P L LAQWL L RAAP ELAP AP FT L P L RVAAATN RWAPT P G P GT P AE RHAD GL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I M I 
Db 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

Qy 61 ALAIjEPAIASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVWKYTQGSWTGFVGEDLVTIPKGFNTSFLWIATI 180 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I 
Db 181 FES ENFFLPGI KWNGI LGLAYATLAKP S S S LET FFDS LVTQANI PNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I II I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS S RS FRI T I LPQLYI QPMMGAGLN YEC YRFGI S P STNALVT GATVMEGF YVT FD 420 

I I I II I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I | I I | | | 
Db 361 YLRDENS SRS FRITILPQLYIQPMMGAGLN YECYRFGI S PSTNALVI GATVMEGFYVT FD 420 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N CVP AQ S L S E P I LWI VS YALMS VC G 480 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N C VP AQ S L S E P I LWI VS YALMS VC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I IJ I 

Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 3 

US-09-548-372D-2 

; Sequence 2, Application US/09548372D 
; Patent No. 6420534 



; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL. 

; TITLE OF INVENTION: ALZHEIMER 1 S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 
; FILE REFERENCE: 29915/62801 

; CURRENT APPLICATION NUMBER: US/09/548 , 372D 
; CURRENT FILING DATE: 2000-04-12 
; PRIOR APPLICATION NUMBER: US 60/155,493 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: PCT/US99/2 088 1 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 
; PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 

LENGTH: 518 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-548-372D-2 



Query Match 100.0%; Score 2687; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.2e-243; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARAL LLP LLAQWL L RAAP ELAPAP FT L P L RVAAATN RWAPT P G P GT P AE RHADG L 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MGAIARALLLPLLAQWLLRAAPEIAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GS GTNGGSLVLGGI EPS LYKGDIWYTPI KEEWYYQI EI LKLEI GGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 241 GSGTNGGSLVLGGI EPSLYKGDIWYTPI KEEWYYQI EI LKLEI GGQSLNLDCREYNADKA 300 



Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I II 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I II I I I I I II I I I I I I 

Db 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 



QY 



421 RAQ KRVGFAAS P CAE I AGAAVS EISGPFSTE DVAS N C VP AQ S L S E P I LW I VS YALMS VC G 480 



Db 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N CVP AQ S L S E P I LW I VS YALMS VC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEVVNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 4 

US-09-548-367D-2 

Sequence 2, Application US/09548367D 
Patent No. 6440698 
GENERAL INFORMATION: 
APPLICANT: GURNEY ET AL. 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 
FILE REFERENCE: 29915/6280H 

CURRENT APPLICATION NUMBER: US/09/548 , 367D 
CURRENT FILING DATE: 2000-04-12 
PRIOR APPLICATION NUMBER: US 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: US 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/208 8 1 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: US 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 2 
LENGTH: 518 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-548-367D-2 

Query Match 100.0%; Score 2687; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.2e-243; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPA£RH7\DGL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MGALARAL L L P L LAQWL L RAAP E LAPAP FT L P L RVAAATN RWAPT P GP GT P AE RHADGL 60 

Qy 61 ALALE PALAS PAGAAN FLAMVDNLQGDS GRGYYLEMLI GT P PQKLQI LVDTGS SNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 ALALE PALAS PAGAAN FLAMVDNLQGDS GRGYYLEMLI GT P PQKLQI LVDTGS SNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 FESENFFLPGI KWNG I L GLAYAT LAKP SSSLETFFDS LVT QAN I PN VFSMQMC GAG L P VA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 



Db 241 GSGTNGGSLVLGGI EPSLYKGDIWYT PI KEEWYYQI EI LKLEI GGQSLNLDCREYNADKA 300 

QY 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQIACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS S RS FRI T I LPQLYI QPMMGAGLN YEC YRFGI S P STNALVI GATVMEGFYVI FD 420 

I I II I I I II I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I II I I II I I I I I I I I I I I I I 

Db 361 YLRDENS S RS FRI TI LPQLYIQPMMGAGLNYEC YRFGI S P STNALVI GATVMEGFYVI FD 420 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS NC VP AQ S L S E P I LW I VS YALM SVC G 480 

I i I M I I M I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I M I I I I | | | | | | | | | | | | | 

Db 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N CVP AQ S L S E P I LW I VS YALMS VC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I II I I I M I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 5 

US-09-551-853D-2 

; Sequence 2, Application US/09551853D 

; Patent No. 6500667 

; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL . 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 
; FILE REFERENCE: 29915/6280L 

; CURRENT APPLICATION NUMBER: US/09/551, 853D 

; CURRENT FILING DATE: 2000-04-18 

; PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 09/404,133 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US99/20881 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 
; PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 2 

LENGTH: 518 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-551-853D-2 

Query Match 100.0%; Score 2687; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.2e-243; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALLLPLLAQWLLRAAPELAPAP FTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I 

Db 1 MGALARAL L L P L LAQWL L RAAP E LAP AP FT L PLRVAAATN RWAP T P G P GT PAE READ GL 60 



Qy 



61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 



Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I M I I II I I I I I I I I I I I I I I I II I I I I I I I I I I | I | M | | | | | I I I I I I 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLWQANIPNVFSMQMCGAGLPVA 24 0 

I I I I I I I I I I I I I I I I II I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II h I I I I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS S RS FRI T I LPQL YI QPMMGAGLN YEC YRFGI S P STNALVI GAT VMEGFYVI FD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 361 YLRDENS SRS FRIT ILPQLYIQPMMGAGLNYECYRFG1SPSTNAL VI GATVMEGFYVIFD 420 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE D VASN CVP AQ S L S E P I LW I VS YALMS VCG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE D VASN CVP AQ S L S E P I LW I VS YALMS VCG 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 6 

US-09-215-450-19 

; Sequence 19, Application US/09215450 

; Patent No. 6635748 

; GENERAL INFORMATION: 

; APPLICANT: Giese, Klaus 

; APPLICANT: Xin, Hong 

; TITLE OF INVENTION: METASTATIC BREAST AND COLON CANCER REGULATED GENES 

; FILE REFERENCE: 1451.100 / 210030.447 

; CURRENT APPLICATION NUMBER: US/09/215, 450 

; CURRENT FILING DATE: 1998-12-17 

; NUMBER OF SEQ ID NOS : 27 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 19 
; LENGTH: 518 

TYPE: PRT 

ORGANISM: human 
US-09-215-450-19 

Query Match 100.0%; Score 2687; DB 4; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.2e-243; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHAiDGL 60 

1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 



1 MGALARAL LL P L LAQWL L RAAP E LAP AP FT L P LRVAAATN RWAPT P G P GT PAE RHAD GL 60 



Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I II I I I I 

Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLWQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 FES EN FFLP GI KWNGI LGLAYATLAKP S S S LET FFD S LVTQANI PNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I M I I I I 
Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I | | | | | | | | | | | | | | M I I I I II 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

Qy 421 RAQ K RVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N C VP AQ S L S E P I LW I VS YALMS VCG 480 

I I I I M I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS NCVP AQ S L S E P I LW I VS YALMS VCG 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 7 
US-09-717-432-2 

Sequence 2, Application US/09717432 
Patent No. 6291223 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



ZHU, YUAN 
LI, XIAOTONG 
CHRISTIE, GARY 
POWELL, DAVID J. 
TITLE OF INVENTION: Mouse Aspartic Secretase-1 (mASPl) 
FILE REFERENCE: GP-70663 

CURRENT APPLICATION NUMBER: US/ 09/717 , 432 
CURRENT FILING DATE: 2000-11-21 
PRIOR APPLICATION NUMBER: 60/166,974 
PRIOR FILING DATE: 1999-11-23 
NUMBER OF SEQ ID NOS : 2 

SOFTWARE: FastSEQ for Windows Version 3.0 
SEQ ID NO 2 
LENGTH: 514 
TYPE: PRT 

ORGANISM: MUS MUSCULUS 
US-09-717-432-2 



Query Match 89.1%; Score 2395; DB 3; Length 514; 

Best Local Similarity 88.6%; Pred. No. 5.6e-216; 

Matches 459; Conservative 20; Mismatches 35; Indels 4; Gaps 



l; 



Qy 1 MGALARAL L L P L LAQWL L RAAP E LAP AP FT L P L RVAAATN R WAP T P G P GT P AE RHAD GL 60 

I I I I I I I I I : I I II I I I I 1111111111:11 III : II I I I MM 

Db 1 MGAL L RAL L L L VLAQW L L S AVP ALAP AP FT L P LQVAGATNH RAS AVP GLGT P E L P RAD GL 60 

Qy 61 ALALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I M I I I : I I I I I I I I M M I I I I II I I I I I II I I I M : I I I I I I II I I I I I I I 

Db 61 ALALEPVRAT ANFIAMVDNLQGDSGRGYYLEMLIGTPPQKVQILVDTGSSNFAVAG 116 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I M I I : I INI M I I II I I I I I II I I I I I M M I II II I I I I : I II I I I II I 
Db 117 APHSYIDTYFDSESSSTYHSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNSSFLVNIATI 176 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I II I I : : I I I || I I I I I I || 
Db 177 FESENFFLPGIKWNGILGLAYAALAKPSSSLETFFDSLVAQAKIPDIFSMQMCGAGLPVA 236 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I M I I I II I I I I I I I I I I II I I I I I I I I I M I I I II I I I I I I I I I : I I I I I I I II II I I 
Db 237 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQNLNLDCREYNADKA 296 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I M I I I : II I I I I I I I II I : I I I I I I I 
Db 297 IVDSGTTLLRLPQKVFDAVVEAVARTSLIPEFSDGFWTGAQLACWTNSETPWAYFPKISI 356 

Qy 361 YLRDENS SRS FRI TI LPQLYIQPMMGAGLN YECYRFGI S PSTNALVI GATVMEGFYVI FD 420 

I I I I I I : I I I M I I I I I I I I I I I I I I II II I I I M II I I I I I I M M I I I I I I I I : II 

Db 357 YLRDENAS RS FRI T I LPQLYI QPMMGAGFN YECYRFGI S S STNALVI GATVMEGFYWFD 416 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFST ED VASN C VP AQ S L S E P I LW I VS YALMS VC G 480 

IMMMII MIMI I I I I I II I I I I I I : I I I M I I I : I : I I I I I II I I I I I I I I I 
Db 417 RAQ RRVG FAVS P CAE I E GT TVS EISGPFSTED I ASN CVP AQALN E P I LW I VS YALMS VC G 476 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I II I : I I I M I : I I II I II II I I I I I I I I I I 
Db 477 AILLVLILLLLLPLHCRHAPRDPEWNDESSLVRHRWK 514 



RESULT 8 
US-09-912-484-2 

; Sequence 2, Application US/09912484 

; Patent No. 6358725 

; GENERAL INFORMATION: 

; APPLICANT: Christie, Gary 

; APPLICANT: Li, Xiaotong 

; APPLICANT: Powell, David J. 

; APPLICANT: Zhu, Yuan 

; TITLE OF INVENTION: Mouse Aspartic Secretase-1 (mASPl) 

; FILE REFERENCE: GP-70663-D1 

; CURRENT APPLICATION NUMBER: US/09/912,484 

; CURRENT FILING DATE: 2001-07-25 

; PRIOR APPLICATION NUMBER: 60/166,974 

; PRIOR FILING DATE: 1999-11-23 



; PRIOR APPLICATION NUMBER: 09/717,432 
; PRIOR FILING DATE: 2000-11-21 
; NUMBER OF SEQ ID NOS : 2 

; SOFTWARE : FastSEQ for Windows Version 3.0 
; SEQ ID NO 2 

LENGTH: 514 

TYPE : PRT 

ORGANISM: MUS MUSCULUS 
US-09-912-484-2 

Query Match 89.1%; Score 2395; DB 4; Length 514; 

Best Local Similarity 88.6%; Pred. No. 5.6e-216; 

Matches 459; Conservative 20; Mismatches 35; Indels 4; Gaps 1; 

Qy 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRVVAPTPGPGTPAERHADGL 60 

I I I I I I I I I : II I I I I I I I I I I I I I I I I : I I I I I : I I I I I I I I I 

Db 1 MG7VLLRALLLLVLAQWLLSAVPALAPAPFTLPLQVAGATNHRASAVPGLGTPELPRADGL 60 

Qy 61 ALALEPAIASPAGAANFLAMVTDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I 

Db 61 ALALEPVRAT AN FLAMVDNLQGD S GRG Y YL EML I GT P PQKVQ I LVDT GS SN FAVAG 116 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I II I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I 
Db 117 APHSYIDTYFDSESSSTYHSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNSSFLVNIATI 176 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I :: I I I I I I I I I I I I I 
Db 177 FESENFFLPGIKWNGILGLAYAALAKPSSSLETFFDSLVAQAKIPDIFSMQMCGAGLPVA 236 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 30 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I 

Db 237 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQNLNLDCREYNADKA 296 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II : I I I I I I II I I I I : I I I I I I I 
Db 297 IVDSGTTLLRLPQKVFDAWEAVARTSLIPEFSDGFWTGAQLACWTNSETPWAYFPKISI 356 

Qy 361 YLRDENS SRS FRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNAL VI GATVMEGFYVI FD 420 

I I I I I I : I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I 

Db 357 YLRDENAS RS FRI TI LPQLYI QPMMGAGFN YECYRFGI S S STNALVI GATVMEGFYWFD 416 

Qy 421 RAQKRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N C VP AQ S LS E P I LW I VS YALMS VC G 48 0 

I I I : I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I M I : I : I I I I I I I I I I I I I II I 

Db 417 RAQRRVGFAVSPCAEIEGTTVSEISGPFSTEDIASNCVPAQALNEPILWIVSYALMSVCG 476 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I : I II I I I : I I I I I I I I I I I I I I I I I I I 

Db 477 AILLVLILLLLLPLHCRHAPRDPEWNDESSLVRHRWK 514 



RESULT 9 
US-09-713-158-2 

; Sequence 2, Application US/09713158 
; Patent No. 6361975 
; GENERAL INFORMATION: 
; APPLICANT: ZHU, YUAN 



APPLICANT : LI, XIAOTONG 
APPLICANT: POWELL, DAVID J. 
APPLICANT: CHRISTIE, GARY 

TITLE OF INVENTION: MOUSE ASPARTIC SECRETASE-2 (MASP-2) 
FILE REFERENCE: GP-70660 

CURRENT APPLICATION NUMBER: US/09/713, 158 
CURRENT FILING DATE: 2000-11-15 
PRIOR APPLICATION NUMBER: 60/165,800 
PRIOR FILING DATE: 1999-11-16 
NUMBER OF SEQ ID NOS : 2 

SOFTWARE : FastSEQ for Windows Version 3.0 
SEQ ID NO 2 
LENGTH: 501 
TYPE: PRT 

ORGANISM: MUS MUSCULUS 
US-09-713-158-2 

Query Match 44.2%; Score 1186.5; DB 4; Length 501; 

Best Local Similarity 46.0%; Pred. No. 1.4e-102; 

Matches 238; Conservative 83; Mismatches 167; Indels 29; Gaps 8; 

Qy 7 ALLLPLLAQWLLRAAPELAPAPFT L P L RVAAATN RWAP T P G P GT P AERHAD GLA 61 

I I I I I : : I I I I I I I I II II : 

Db 2 AQALPWLLLWV GSGMLPAQGTHLGIRLPLRSGLA GPPLGLRLPRETDEES — 51 

Qy 62 LALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGT 121 

I : I : I I I I I : I I I : I I I : I I : I : I I I I I I I I I I I I I I I I 

Db 52 EEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAA 104 

Qy 122 PHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIF 181 

II:: I : : I I I I I I I I I I I I I : I I I I : I I I I : I I I I 

Db 105 PHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAIT 164 

Qy 182 ESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV — 239 

II: II: I I I I I I I I I I : I : I III I I I I I I I : I I I : I I : I : I I I I I : 
Db 165 ESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQ 224 

Qy 240 -AGS GTNGGS LVLGGI EP S L YKGDI WYT PI KEEW YYQI EI LKLEI GGQS LNLDCRE YNAD 298 

: I I I : : : I I I : III I : I I I I I : MM:: | : : : | | I I I : | | : | | | I 
Db 225 TEALASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYD 284 

Qy 299 KAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKI 358 

I : I I I I I I I I I I I : I I I : I I : : : II : I I I I I I I I I I II I : I I I 

Db 285 KSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVI 344 

Qy 359 S I YLRDENS S RS FRI T I LPQL YI QPMMGAGLN Y- EC YRFGI S PSTNALVI GATVMEGFYV 417 

I : I I I : : : | | | | | | | | | I : : I : : : I I : I : I I : I : I I : I I I I I I 

Db 345 SLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYV 404 

Qy 418 I FD RAQ KRVGFAAS P CAE I AGAAVS EISGPFSTE DVAS NC VP AQ S L S E P I LW I VS YALMS 477 

: I I I I : I I : I I I I I : : I I I I I : II : : I : : 

Db 405 VFDRARKRIGFAVSACH^DEFRTAAVEGPFWADMEDCGYNIPQTDESTLMTIAYVMAA 464 

Qy 478 VCGAILLVLIVLLLLPFRCQR — RPRDPEWNDESSL 512 

: I I : : : : I : : : I I I I : : : I I I 

Db 465 IC-ALFMLPLCLMVCQWRCLRCLRHQHDDFADDISLL 500 



RESULT 10 
US-09-548-372D-8 

; Sequence 8, Application US/09548372D 

; Patent No. 6420534 

; GENERAL INFORMATION : 

; APPLICANT: GURNEY ET AL. 

; TITLE OF INVENTION : ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 
; FILE REFERENCE: 29915/62801 

; CURRENT APPLICATION NUMBER: US/09/548, 372D 

; CURRENT FILING DATE: 2000-04-12 

; PRIOR APPLICATION NUMBER: US 60/155,493 

; PRIOR FILING DATE: 1999-09-23 

; PRIOR APPLICATION NUMBER: US 09/404,133 

; PRIOR FILING DATE: 1999-09-23 

; PRIOR APPLICATION NUMBER: PCT/US99/2088 1 

; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 60/101,594 
; PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 

LENGTH: 501 
TYPE: PRT 

ORGANISM: Mus rnusculus 
US-09-548-372D-8 



Query Match 44.1%; Score 1185; DB 4; Length 501; 

Best Local Similarity 46.0%; Pred. No. 1.9e-102; 

Matches 237; Conservative 83; Mismatches 169; Indels 26; Gaps 7; 

Qy 9 LLPLLAQWLLRAAPELAPAPFT L P L RVAAATN RWAP T P G P GT P AE RHAD GLALA 63 

: I I II : II I I I I I I II || : 
Db 1 MAPALHWLLLWVGSGMLPAQGTHLGIRLPLRSGLA GPPLGLRLPRETDEES 51 

Qy 64 LE PALAS PAGAANFLAMVDNLQGDSGRGYYLEMLI GT P PQKLQI LVDTGS SNFAVAGT PH 123 

I : I : I I I I I : I I I : I I I : I I : I : I I I I I I I I I I I I II I I II 

Db 52 EEPGRRGS FVEMVDNLRGKSGQGYYVEMTVGS PPQTLNI LVDTGS SNFAVGAAPH 106 



Qy 124 SYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFES 183 

: : I : : I I I II I I I I I I I I : I I I I : I I I I : II I I I I 

Db 107 PFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITES 166 

Qy 184 ENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV A 240 

: II: I I I I I I I I I I : I : I III I I I I I I I : I I I : I I : I : I I I I I : 
Db 167 DKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTE 226 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

: I I I : : : I I I : III I : I I I I I : MM:: | : : : | | | | | : | | : I I I I I : 
Db 227 ALASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKS 286 



Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

.1 I I I I I I I I I I : I I I : I I :: : II : I I I I I I I I I I I I I : I I | I : 

Db 287 IVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISL 346 



Qy 361 YLRDENS SRS FRI T I LPQLYIQPMMGAGLN Y- ECYRFGI S P STNALVI GATVMEGFYVI F 419 

II I : : : I I I I I I I I I I : : I : : : I I : I : I I : I : I I : II I I I I : I 

Db 347 YLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWF 406 

Qy 420 D RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N C VP AQ S L S E P I LW I VS YALMS VC 479 

I I I : I I : I I I I I : : | | | | | : | | ::[::: 

Db 407 D RARKRI G FAVS AC HVH D E FRT AAVE G P FVT ADMEDCG YN IPQTDEST LMT I AYVMAAI C 466 

Qy 480 GAILLVLIVLLLLPFRCQR--RPRDPEWNDESSL 512 

I : : : : I : : : I I I I : : : I I I 
Db 4 67 -ALFMLPLCLMVCQWRCLRCLRHQHDDFADDISLL 500 



RESULT 11 
US-09-548-367D-8 

; Sequence 8, Application US/09548367D 

; Patent No. 6440698 . 

; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL. 

TITLE OF INVENTION: ALZHEIMER 1 S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

; TITLE OF INVENTION: THEREOF 

FILE REFERENCE: 29915/6280H 
; CURRENT APPLICATION NUMBER: US/09/548, 367D 
; CURRENT FILING DATE: 2000-04-12 
; PRIOR APPLICATION NUMBER: US 60/155,493 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: PCT/US99/2088 1 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 

PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 
; SOFTWARE: Patent In version 3.1 
; SEQ ID NO 8 

LENGTH: 501 
TYPE: PRT 

ORGANISM: Mus mus cuius 
US-09-548-367D-8 



Query Match 44.1%; Score 1185; DB 4; Length 501; 

Best Local Similarity 46.0%; Pred. No. 1.9e-102; 

Matches 237; Conservative 83; Mismatches 169; Indels 26; Gaps 7; 

Qy 9 LLPLLAQWLLRAAPELAPAPFT L P LRVAAATNRWAPT P GP GT PAERHADGLALA 63 

: I I II : I I I I I I I I II II : 

Db 1 MAPALHWLLLWVGSGMLPAQGTHLGIRLPLRSGLA GPPLGLRLPRETDEES 51 

Qy 64 LEPALAS PAGAAWFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPH 123 

I : I : I I I I I : I I I : I I I : I I : I : I I I I I I I I I I I I I I I I II 

Db 52 EEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPH 106 

Qy 124 S YI DTYFDTERSST YRSKGFDWVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNI ATI FES 183 

: : I : : I I I I I I I I I I I I I : I I I I : I I I I : I I I I I I 

Db 107 PFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITES 166 



Qy 184 ENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV A 240 

: II: I I M I I I I I I : I : I III I I I I I I I : I I I : I I : I : I I I I I : 
Db 167 DKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTE 226 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

: I I I : : : I I h III I : I I I I I : MM:: I : : : I I I I : I I : I I I I I : 

Db 227 ALASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKS 286 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

II I I I I I I I I I : II I : I I :: : II : I I I I I I I I I I I I I : I I I I : 

Db 287 IVDSGTTNLRLPKKVFEAAVKS I KAAS STEKFPDGFWLGEQLVCWQAGTTPWNI FPVI SL 346 

Qy 361 YLRDENS SRS FRITI LPQLYIQPMMGAGLNY-ECYRFGI S PSTNALVI GATVMEGFYVI F 419 

II I : : : I I I I II I I I I : : I : : : I I : I : I I : I : I I : I I I I II : I 

Db 347 YLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWF 406 

Qy 420 D RAQKRVGFAAS P CAE I AGAAVS E I S GP FS T EDVASNCVPAQ S LS E P I LWI VS YALMS VC 479 

I I I : I I : I I I I I : : I I I I I : I I : : | : : : | 

Db 407 D RARKR I G FAVS ACH VH D E FRTAAVE G P FVT ADME D C G YN IPQTDEST LMT I AYVMAAI C 466 

Qy 480 GAILLVLIVLLLLPFRCQR— RPRDPEWNDESSL 512 

I : :: : I :: : I I I I : : : I I I 

Db 467 -ALFMLPLCLMVCQWRCLRCLRHQHDDFADDISLL 500 



RESULT 12 
US-09-551-853D-8 

; Sequence 8, Application US/09551853D 

; Patent No. 6500667 

; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL. 

; TITLE OF INVENTION: ALZHEIMER 1 S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 
; FILE REFERENCE: 29915/6280L 

; CURRENT APPLICATION NUMBER: US/09/551, 853D 

; CURRENT FILING DATE: 2000-04-18 

; PRIOR APPLICATION NUMBER: US 60/155,493 

PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: US 09/404,133 
; PRIOR FILING DATE: 1999-09-23 

PRIOR APPLICATION NUMBER: PCT/US99/2 08 8 1 
; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: US 60/101,594 
; PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 8 

LENGTH: 501 
; TYPE: PRT 

; ORGANISM: Mus mus cuius 
US-09-551-853D-8 

Query Match 44.1%; Score 1185; DB 4; Length 501; 

Best Local Similarity 46.0%; Pred. No. 1.9e-102; 

Matches 237; Conservative 83; Mismatches 169; Indels 26; Gaps 7; 



Qy 9 LLPLLAQWLLRAAPELAPAPFT LPLRVAAATNRWAPTPGPGTPAERHADGLALA 63 

: I I II : M I II I I I II II : 

Db 1 MAPALHWLLLWVGSGMLPAQGTHLGIRLPLRSGLA GPPLGLRLPRETDEES 51 

Qy 64 LEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPH 123 

I : I : I I I I I : I I I : I I I : I I : I : I I I I I I I I I I I I I I I I II 

Db 52 EEPGRRGS FVEMVDNLRGKS GQGYYVEMTVGS PPQTLNILVDTGS SNFAVGAAPH 106 

Qy 124 SYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFES 183 

: • I : : I I I II I I II I I I I : I I II : I I I I : I I I I I I 

Db 107 PFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITES 166 

Qy 184 ENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV A 240 

: II: I I I I I I I I I I : I : I III I I I I I I I : I I I : I I : I : I I I I I : 

Db 167 DKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTE 226 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

: I I I : : : I I I : III I : I I I I I : MM:: I : : : I I I I I : | | : | | | | | : 
Db 227 ALASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKS 286 

Qy 301 IVDSGTTLLRLPQKVFDAWEAV7VRASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I : I I I : I I :: : II : I I I I I I I I I I I I I : I I I I : 

Db 287 IVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISL 346 

Qy 361 YLRDENS S RS FRI T I L PQL YI QPMMGAGLN Y- EC YRFGI S PSTNALVI GATVMEGFYVI F 419 

II I : : : I I I I I I I I I I : : I : : : II : I : I I : I- : I I : I I I I I I : I 

Db 347 YLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWF 4 06 

Qy 420 D RAQ K RVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N CVP AQ S L S E P I LW I VS Y ALMS VC 479 

I I I : I I : I I I I I : : I I I I I : I I : : I : : : I 

Db 4 07 DRARKRIGFAVSACHVHDEFRTAAVEGPFVTADMEDCGYNIPQTDESTLMTIAYVMAAIC 4 66 

Qy 480 GAILLVLIVLLLLPFRCQR — RPRDPEWNDESSL 512 

I : : : : I :': : I I I I : : : I I I 
Db 4 67 -ALFMLPLCLMVCQWRCLRCLRHQHDDFADDI SLL 500 



RESULT 13 
US-09-724-566A-65 

Sequence 65, Application US/09724566A 
Patent No. 6627739 
GENERAL INFORMATION: 



Anderson, John P . 
Basi, Guriqbal 
Doane, Minh Tarn 
Frigon, No. 6627739mand 
John, Varghese 
Power, Michael 
Sinha, Sukanto 
Tatsuno, Gwen 
Tung, Jay 
Wang, Shuwen 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT: McConlogue, Lisa 

TITLE OF INVENTION: Beta-Secretase Enzyme Compositions and 

TITLE OF INVENTION: Methods 

FILE REFERENCE: 22 8-US-NEWC2 

CURRENT APPLICATION NUMBER: US/09/724 , 566A 



CURRENT FILING DATE: 2000-11-28 
PRIOR APPLICATION NUMBER: US 09/501,708 
PRIOR FILING DATE: 2000-02-10 
PRIOR APPLICATION NUMBER: 60/119,571 
PRIOR FILING DATE: 1999-02-10 
PRIOR APPLICATION NUMBER: 60/139,172 
PRIOR FILING DATE: 1999-06-15 
NUMBER OF SEQ ID NOS : 104 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 65 
LENGTH: 501 
TYPE: PRT 

ORGANISM: Mus mus cuius 
US-09-724-566A-65 

Query Match 44.1%; Score 1184.5; DB 4; Length 501; 

Best Local Similarity 45.9%; Pred. No. 2.1e-102; 

Matches 237; Conservative 84; Mismatches 17 0; Indels 25; Gaps 7; 

Qy 9 L L P L LAQ W L L RAAP E LAP AP FT LPLRVAAATNRWAPTPGPGTPAERHADGLALA 63 

: I I II : I I I I I I I I II II : 
Db 1 MAPALHWLLLWVGSGMLPAQGTHLGIRLPLRSGLA GPPLGLRLPRETDEES 51 

Qy 64 LEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPH 123 

I : I : I I I I I : I I I : I I I : I I : I : I I I I I I I I I II I I I I I II 

Db 52 EEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPH 106 

Qy 124 SYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFES 183 

: : I : : I I I I I I I I I I I I I : I I I I : I I I I : I I I I I I 

Db 107 PFLHRYYQRQLSSTYRDLRKGVWPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITES 166 

Qy 184 ENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV A 240 

: II: I I I I I I I I I I : I : I III I I I II I I : I I I : I I : I : I I I I I : 
Db 167 DKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTE 226 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

: I I I : : : I I I : III I : I I I I I : MM:: I : : : I I | | | : I I : I M I I : 
Db 227 ALASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKS 286 

Qy 301 IVDSGTTLLRLPQKVFDAWEAV7VRASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I : I I I : I I :: : II : I II I I I I I I I I I I : I I I I : 

Db 287 IVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISL 346 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNY-ECYRFGISPSTNALVIGATVMEGFYVIF 419 

I I I : : : I I I I t I I I I I : : I : : : I I : I : I I : I : I I : I I I I I I : I 

Db 347 YLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWF 4 06 

Qy 420 D RAQK RVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N C VP AQ S L S E P I LW I VS YALMS VC 479 

I I I : I I : I II I I : : II I I I : I I : : I : : : I 

Db 407 DRARKRIGFAVSACHVHDEFRTAAVEGPFVTADMEDCGYNIPQTDESTLMTIAYVMAAIC 466 

Qy 480 GAILLVLIVLLLLPFRCQRRPR-DPEWNDESSLVR 514 

I : : : : I : : : I I I I : I : I I : : 
Db 467 -ALFMLPLCLMVCQWRCLRCLRHQHDDFGDDISLLK 501 



RESULT 14 



US-09-548-372D-4 

Sequence 4, Application US/09548372D 
Patent No. 6420534 
GENERAL INFORMATION: 
APPLICANT: GURNEY ET AL. 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 
FILE REFERENCE: 29915/62801 

CURRENT APPLICATION NUMBER: US/09/54 8 , 372D 
CURRENT FILING DATE: 2000-04-12 
PRIOR APPLICATION NUMBER: US 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: US 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/20881 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: US 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 4 
LENGTH: 501 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-548-372D-4 

Query Match 43.9%; Score 1178.5; DB 4; Length 501; 

Best Local Similarity 46.2%; Pred. No. 7.8e-102; 

Matches 24 0; Conservative 82; Mismatches 164; Indels 33; Gaps 9; 

Qy 7 ALLLPLLAQWLLRAAPELAPAPFT LPLRVAAATNRWAPTPGPGTPAERHADGLA 61 

I I I I I : : I I I I I I I II II 

Db 2 AQALPWLLLWM GAGVLPAHGTQHGIRLPLRSGLG GAPL GLR 42 

Qy 62 LALE — PALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVA 119 

I I I : I : I I I I I : I I I : I I I : II : I : I I I I I I I I I I I I I I I I 

Db 4 3 L P RET D E E P E E P GRRG S FVEMVDN L RGK S GQ G Y YVEMT VG S P PQT LN I LVDT G S S N FAVG 102 

Qy 120 GTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIAT 179 

II:: I : : I I I I I I I I I I I I I : I I I I : I I I I : III 

Db 103 AAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAA 162 

Qy 180 IFESENFFLPGIKWNGILGIAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV 239 

I II: II: I I I I I I I I I I : I : I I I I I I I I I I I : : I I : I I : I : I I I I I : 

Db 163 ITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPL 222 

Qy 240 AGS GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYN 296 

I : I II : : : I I I : III I : I I I I I : MM:: I : : : I I II I : I I : I I I 

Db 223 NQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYN 282 

Qy 2 97 ADKAIVX)SGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFP 356 

I I : I I I I I I I I I M : I I I : I I : : : II : I I I I I I I I I I I II : I I 

Db 283 YDKSI VDSGTTNLRLPKKVFEAAVKS I KAAS STEKFPDGFWLGEQLVCWQAGTTPWNI FP 342 

Qy 357 KISIYLRDENSSRSFRITILPQLYIQPMMGAGLNY-ECYRFGISPSTNALVIGATVMEGF 415 

I I : I I I ::: I I I I I I I I I I : : I : : : I I : I II I : I : II : I I I I 



Db 343 VISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGF 402 



Qy 416 YVI FD RAQ K RVG FAAS P CAE I AGAAVS EISGPFSTE DVASNCVP AQ S L S E P I LW I VS YAL 475 

I I : I I I I : I I : I I I I I : : I I I I I : I I : : I : 

Db 4 03 YWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQTDESTLMTIAYVM 462 

Qy 476 MSVCGAILLVLIVLLLLPFRCQR — RPRDPEWNDESSL 512 

: : I I : : : : I : : : I I I I : : : I I I 

Db 463 AAIC-ALFMLPLCLMVCQWRCLRCLRQQHDDFADDISLL 500 



RESULT 15 
US-09-548-367D-4 

; Sequence 4, Application US/09548367D 

; Patent No. 6440698 

; GENERAL INFORMATION: 

; APPLICANT: GURNEY ET AL. 

; TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR 
AND USES 

TITLE OF INVENTION: THEREOF 
FILE REFERENCE: 29915/6280H 

CURRENT APPLICATION NUMBER: US/09/54 8 , 367D 
CURRENT FILING DATE: 2000-04-12 
PRIOR APPLICATION NUMBER: US 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: US 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/2088 1 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: US 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 4 
LENGTH: 501 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-548-367D-4 

Query Match 43.9%; Score 1178.5; DB 4; Length 501; 

Best Local Similarity 46.2%; Pred. No. 7.8e-102; 

Matches 24 0; Conservative 82; Mismatches 164; Indels 33; Gaps 9; 

Qy 7 ALLLPLLAQWLLRAAPELAPAPFT L P LRVAAATN RWAPT P G P GT P AE RHADGLA 61 

I II I I : : I I I I I I I II II 
Db 2 AQALPWLLLWM GAGVLPAHGTQHGIRLPLRS.GLG GAPL GLR 42 

Qy 62 LALE — PALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVA 119 

I I I : I : I I I I I : I I I : II I : I I : I : I I I I I I I I I I I I I I I I 

Db 43 LPRETDEEPEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVG 102 

Qy 120 GTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIAT 179 

II:: I : : I I I I I I I I I I I I I : I I I I : I I I I : III 

Db 103 AAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAA 162 

Qy 180 IFESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV 239 

I II: II: I I I I I I I I I I : I : I III I I I I I I I : : I I : I I : I : I I I I I : 



Db 163 ITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPL 222 

Qy 240 AGS GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYN 296 

I : | I I : : : I I I : III I : II I I I : INI:: I : : : I I I I | : I I : I I I 

Db 223 NQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYN 282 

Qy 297 ADKAIVDSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFP 356 

I I : I I I I I I I I I I I : I I I : I I :: : II : I I I I I I I I I I I I I : I I 

Db 283 YDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFP 342 

Qy 357 KI S I YLRDENS S RS FRI T I LPQL YI QPMMGAGLN Y- EC YRFGI S P STNALVI GATVMEGF 415 

11:11 I : : : I I I I I I I I I I : : | : : : I I : I I I I : I : I I : I II I 

Db 343 VISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGF 402 

Qy, 416 YVI FDRAQKRVGFAAS PCAE I AGAAVS E I S G P FS T EDVASNCVPAQ S L S E P I LWI VS YAL 475 

I I : I I I I : I I : I I I I I : : I I I I I : I I : : j : 

Db 403 YWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQTDESTLMTIAYVM 4 62 

Qy ■ 476 MSVCGAILLVLIVLLLLPFRCQR — RPRDPEWNDESSL 512 

: : I I : : : : I : : : I I I I : : : I I I 

Db 463 AAIC-ALFMLPLCLMVCQWRCLRCLRQQHDDFADDISLL 500 

Search completed: February 28, 2004, 09:12:45 
Job time : 19.5 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: February 28, 2004, 09:11:20 ; Search time 33.6364 Seconds 

(without alignments) 
3251.760 Million cell updates/sec 

Title: US-09-668-314C-2 
Perfect score: 2687 

Sequence:- 1 MGALARALLLPLLAQWLLRA RPRDPEWNDESSLVRHRWK 518 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 809742 seqs, 211153259 residues 

Total number of hits satisfying chosen parameters: 809742 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_AA: * 

1 : /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB . pep : * 

2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 

3 : /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB . pep : * 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: * 

5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep: * 

6: /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep: * 

7 : /cgn2_6/ptodata/2/pubpaa'/US08jtfEW_PUB.pep: * 

8 : /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 

9: /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep: * 
10: /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep: * 
11 : /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep: * 
12 : /cgn2_6/ptodata/2/pubpaa/US0 9__NEW_PUB.pep : * 
13 : /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep: * 
14 : /cgn2_6/ptodata/2/pubpaa/USl0B_PUBC0MB.pep: * 
15 : /cgn2_6/ptodata/2/pubpaa/US10C__PUBCOMB.pep: * 
16: /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep: * 
17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep: * 
18 : /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


2687 


100. 


0 


518 


9 


US- 


09- 


794- 


927-2 


Sequence 


2, Appli 


2 


2687 


100. 


0 


518 


9 


us- 


09- 


795- 


847-2 


Sequence 


2, Appli 


3 


2687 


100. 


0 


518 


9 


us- 


09- 


794- 


743-2 


Sequence 


2, Appli 


4 


2687 


100. 


0 


518 


9 


US- 


09- 


794- 


748-2 


Sequence 


2, Appli 


5 


2687 


100. 


0 


518 


9 


us- 


09- 


794- 


925-2 


Sequence 


2, Appli 


6 


2687 


100. 


0 


518 


9 


US- 


09- 


215- 


450-19 


Sequence 


19, Appl 


7 


2687 


100. 


0 


518 


9 


us- 


09- 


681- 


442-2 


Sequence 


2, Appli 


8 


2687 


100. 


0 


518 


9 


us- 


09- 


978- 


295A-196 


Sequence 


196, App 


9 


2687 


100. 


0 


518 


9 


us- 


09- 


886- 


143-2 


Sequence 


2 , Appli 


10 


2687 


100. 


0 


518 


9 


us- 


09- 


978- 


697-196 


Sequence 


196, App 


11 


2687 


100. 


0 


518 


9 


us- 


09- 


978- 


192A-196 


Secruence 


196, App 


12 


2687 


100. 


0 


518 


9 


us- 


09- 


999- 


832A-196 


Sequence 


196, App 


13 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-189-196 


Sequence 


196, App 


14 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-608A-196 


Sequence 


196, App 


15 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-585A-196 


Sequence 


196, App 


16 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-191A-196 


Sequence 


196, App 


17 


■ 2687 


100. 


0 


518 


10 


us 


-09 


-978 


-403A-196 


Sequence 


196, App 


18 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-564A-196 


Sequence 


196, App 


19 


2687 


100. 


0 


518 


10 


us 


-09 


-999 


-833A-196 


Sequence 


196, App 


20 


2687 


100. 


0 


518 


10 


us 


-09 


-981 


-915A-196 


Sequence 


196, App 


21 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-824-196 


Sequence 


196, App 


22 


2687 


100. 


0 


518 


10 


us 


-09 


-918 


-585A-196 


Sequence 


196, App 


23 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-423A-196 


Sequence 


196, App 


24 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-193A-196 


Sequence 


196, App 


25 


2687 


100. 


0 


518 


10 


us 


-09 


-869 


-414-2 


Sequence 


2, Appli 


26 


2687 


100. 


0 


518 


10 


us 


-09 


-999 


-830A-196 


Sequence 


196, App 


27 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-757A-196 


Sequence 


196, App 


28 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-187B-196 


Sequence 


196, App 


29 


2687 


100. 


0 


518 


10 


us 


-09 


-548 


-366-2 


Sequence 


2, Appli 


30 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-643A-196 


Sequence 


196, App 


31 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-375A-196 


Sequence 


196, App 


32 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-298A-196 


Sequence 


196, App 


33 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-188A-196 


Sequence 


196, App 


34 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-681A-196 


Sequence 


196, App 


35 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-194A-196 


Sequence 


196, App 


36 


2687 


100. 


0 


518 


10 


us 


-09 


-999 


-829A-196 


Sequence 


196, App 


37 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-299A-196 


Sequence 


196, App 


38 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-544A-196 


Sequence 


196, App 


39 


2687 


100. 


0 


518 


10 


us 


-09' 


-978 


-665A-196 


Sequence 


196, App 


40 


2687 


100. 


0 


518 


10 


us 


-09 


-978 


-802A-196 


Sequence 


196, App 


41 


2687 


100. 


0 


518 


13 


us 


-10 


-052 


-586-72 


Sequence 


72, Appl 


42 


2687 


100. 


0 


518 


14 


us 


-10 


-174 


-590-72 


Sequence 


72, Appl 


43 


2687 


100. 


0 


518 


14 


us 


-10' 


-176 


-758-72 


Sequence 


72, Appl 


44 


2687 


100. 


0 


518 


14 


us 


-10 


-175 


-737-72 


Sequence 


72, Appl 


45 


2687 


100. 


0 


518 


14 


us 


-10 


-173 


-706-72 


Sequence 


72, Appl 



ALIGNMENTS 



RESULT 1 
US-09-794-927-2 

; Sequence 2, Application US/09794927 

; Patent No. US20010016324A1 

; GENERAL INFORMATION: 

; APPLICANT: Gurney, Mark E. 



APPLICANT: Bienkowski, Michael J. 
APPLICANT: Heinrikson, Robert L. 
APPLICANT: Parodi, Luis A. 
APPLICANT: Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER 1 S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND 

TITLE OF INVENTION: USES 
TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280FG 
CURRENT APPLICATION NUMBER: US/09/794,927 
CURRENT FILING DATE: 2001-02-27 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/20881 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 2 
LENGTH: 518 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-794-927-2 

Query Match 100.0%; Score 2687; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALLLPLIAQWLLR7UVPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MGALARAL L L P L LAQWL L RAAP E LAP AP FT L P L RVAAATN RWAP T P GP GT P AE RHAD GL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I M I I I II I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 



Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N CVPAQ S L S E P I LW I VS YALMS VC G 480 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 RAQKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCG 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I II I I I M I I I I I I I II I I 
Db 481 AI LLVLI VLLLLP FRCQRRP RDPEWNDES S LVRHRWK 518 



RESULT 2 
US-09-795-847-2 

Sequence 2, Application US/09795847 
Patent No. US20010018208A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Gurney, Mark E. 
Bienkowski, Michael J. 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND 

TITLE OF INVENTION: USES 
TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280DE 
CURRENT APPLICATION NUMBER: US/09/795, 847 
CURRENT FILING DATE: 2001-02-28 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/20881 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 2 
LENGTH: 518 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-795-847-2 



Query Match 100.0%; Score 2687; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALLLPLIAQWLLRAAPEIAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II 

Db 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 



Qy 



61 ALALEPALAS PAGAAN FLAMVDNLQGDS GRGYYLEMLI GTPPQKLQ I LVDTGS SNFAVAG * 120 



II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II II I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTN5ETPWSYFPKISI 360 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 YLRDENS SRS FRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FD 420 

Qy 421 RAQKRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N CVP AQ S L S E P I LWI VS YALMS VC G 480 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db ' 421 RAQKRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N C VP AQ SLSEPILWI VS YALMS VC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 3 
US-09-794-743-2 

; Sequence 2, Application US/09794743 
Patent No. US20010021391A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Gurney, Mark E. 
Bienkowski, Michael J. 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan f Riqiang 

; TITLE OF INVENTION: ALZHEIMER 1 S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND 

TITLE OF INVENTION: USES 
TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280BC 
CURRENT APPLICATION NUMBER: US/09/794,743 
CURRENT FILING DATE: 2001-02-27 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/20881 
PRIOR FILING DATE: 1999-09-23 



PRIOR APPLICATION NUMBER: 60/101,594 
; PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 

SOFTWARE: PatentlnVer. 2.0 
; SEQ ID NO 2 

LENGTH: 518 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-794-743-2 

Query Match 100.0%; Score 2687; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARAL LLP LLAQWL L RAAP E LAP AP FT L P L RVAAATNRWAP T P G P GT PAE RHAD GL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MGAIAJRALLLPLIAQWLLRAAPEIAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLWIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I 

Db 301 IVDSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS SRS FRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FD 420 

I I I I I I I I II I II I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 361 YLRDENS S RS FRITI LPQLYI QPMMGAGLNYECYRFGI S P STNALVT GATVMEGFYVI FD 420 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N C VP AQ SLSEPILWI VS YALMS VC G 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I 
Db 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS NC VP AQS L S E P I LW I VS YALMS VC G 480 

Qy 4 81 AI LLVLI VLLLLPFRCQRRPRDPEWNDES S LVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 AI LLVLI VLLLLPFRCQRRPRDPEWNDES S LVRHRWK 518 



RESULT 4 
US-09-794-748-2 

; Sequence 2, Application US/09794748 

; Patent No. US20020037315A1 

; GENERAL INFORMATION: 

; APPLICANT: Gurney, Mark E. 



APPLICANT: Bienkowski, Michael J. 
APPLICANT: Heinrikson, Robert L. 
APPLICANT: Parodi, Luis A. 
APPLICANT: Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND 

TITLE OF INVENTION: USES 
TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280JL 
CURRENT APPLICATION NUMBER: US/09/794,748 
CURRENT FILING DATE: 2001-02-27 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/208 81 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 73 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 2 
LENGTH: 518 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-794-748-2 

Query Match 100.0%; Score 2687; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARAL L L P LLAQWLLRAAP ELAP AP FT L P L RVAAATN RWAP T P G P GT P AE RHADGL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MGALARAL L L P L LAQW L LRAAP E LAP AP FT L P L RVAAAT N RWAP T P G P GT P AE RHADGL 60 

Qy 61 ALALE PALAS PAGAAN FLAMVDNLQGDS GRGYYLEMLI GT P PQKLQI LVDTGS SN FAVAG 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml I I I I I I I I I I I I I I I 
Db 61 ALALE PALAS PAGAAN FLAMVDNLQGDS GRGYYLEML I GTPPQKLQ I LVDTGSSN FAVAG 12 0 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 



Qy 361 YLRDENS S RS FRI T I LPQLYI QPMMGAGLN YECYRFGI S P STNALVI GATVMEGFYVI FD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 361 YLRDENS SRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

QY 421 RAQ K RVG FAAS P CAE I AGAAVS EISGPFSTE D VAS NC VP AQ S L S E P I LW I VS YALMS VC G 480 

I M I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | M I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 RAQKRVGFT^ASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCG 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 5 
US-09-794-925-2 

Sequence 2, Application US/09794925 
Patent No. US20020064819A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Gurney, Mark E. 
Bienkowski, Michael J. 
Heinrikson, Robert L. 
Parodi, Luis A. 
Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND USES 

; TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280HI 
CURRENT APPLICATION NUMBER: US/09/794,925 
CURRENT FILING DATE: 2001-02-27 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/20881 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 60/101,594 
PRIOR FILING DATE: 1998-09-24 
NUMBER OF SEQ ID NOS : 7 3 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 2 
LENGTH : 518 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-794-925-2 



Query Match 100.0%; Score 2687; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGAL7VRALLLPLLAQWLLR7^PELAPAPFTLPLRVA7\ATNRWAPTPGPGTPAERHADGL 60 

I I I I I M I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

Qy 61 ALALE P ALAS P AGAAN FLAMVDN LQ G D S G RG Y YL EML I GT P P Q KLQ I L VDT G S S N FAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 



Db 



61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLI GT PPQKLQI LVDTGS SNFAVAG 12 0 



Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

M I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I i I II I I I I I I I I I 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

QY 181 FESENFFLPGIKWNGILGLAYATIAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I M I I I II I I I I I I I 
Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | 
Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDEN S S RS FRI T I LPQL YI QPMMGAGLN YEC YRFGI S P STNALVI GATVMEGFYVT FD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 YLRDENS S RS FRI T I LPQL YI Q PMMGAGLNYEC YRFGI S P STNALVI GATVMEGFYVT FD 420 

Qy 421 RAQKRVG FAAS P CAE I AGAAVS E I SGP FSTEDVASNCVP AQ S LS E P I LWI VS YALMS VCG 480 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 RAQKRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N C VP AQ SLSEPILWI VS YALMS VC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

RESULT 6 

US-09-215-450-19 

; Sequence 19, Application US/09215450 

; Patent No. US20020068278A1 

; GENERAL INFORMATION: 

; APPLICANT: Giese, Klaus 

; APPLICANT: Xin, Hong 

; TITLE OF INVENTION: METASTATIC BREAST AND COLON CANCER REGULATED GENES 

; FILE REFERENCE: 1451.100 / 210030.447 

; CURRENT APPLICATION NUMBER: US/09/215,450 

; CURRENT FILING DATE: 1998-12-17 

; NUMBER OF SEQ ID NOS : 27 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 19 

LENGTH: 518 

TYPE: PRT 
; ORGANISM: human 
US-09-215-450-19 

Query Match 100.0%; Score 2687; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALLLPLLAQWLLRAAPEIA.PAPFTLPLRVAAATNRVVAPTPGPGTPAERHADGL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MGALAR7aLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 



Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I II I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS SRS FRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 YLRDENS SRS FRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FD 420 

Qy 421 RAQK RVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N CVP AQ S L S E P I LW I VS YALMS VC G 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

Db 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N C VP AQ S L S E P I LW I VS YALMS VCG 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I M I I I I I II I I I I I I I I M I I I I I I I I I I I I I I 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 7 
US-09-681-442-2 

Sequence 2, Application US/09681442 
Patent No. US20020081634A1 
GENERAL INFORMATION: 
APPLICANT: Gurney, Mark E. 
APPLICANT: Bienkowski, Michael J. 
APPLICANT: Heinrikson, Robert L. 
APPLICANT: Parodi, Luis A. 
APPLICANT: Yan, Riqiang 

TITLE OF INVENTION: ALZHEIMER'S DISEASE SECRETASE, APP SUBSTRATES THEREFOR, 
AND USES 

TITLE OF INVENTION: THEREFOR 
FILE REFERENCE: 28341/6280FG 
CURRENT APPLICATION NUMBER: US/09/681,442 
CURRENT FILING DATE: 2001-04-05 
PRIOR APPLICATION NUMBER: 09/416,901 
PRIOR FILING DATE: 1999-10-13 
PRIOR APPLICATION NUMBER: 60/155,493 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: 09/404,133 
PRIOR FILING DATE: 1999-09-23 
PRIOR APPLICATION NUMBER: PCT/US99/2088 1 



; PRIOR FILING DATE: 1999-09-23 
; PRIOR APPLICATION NUMBER: 60/101,594 
; PRIOR FILING DATE: 1998-09-24 
; NUMBER OF SEQ ID NOS : 73 
; SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 2 
; LENGTH: 518 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-681-442-2 

Query Match 100.0%; Score 2687; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGAIARALLLPL^QWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGP 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRV7\AATNRWAPTPGPGTPAERHADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDS GRGYYLEMLI GT P PQKLQ I LVDTGS SNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEML I GTP PQKLQ I LVDTGS SNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

M I I I I I I I M I II I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

Db 181 FES ENFFLPGI KWNGI LGLAYATLAKP S SSLETFFDS LVTQANI PNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | 
Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAV\^^VARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDEN S S RS FRI T I LPQL YIQPMMGAGLN YEC YRFGI S P STNALVI GAT VMEGFYVI FD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

Qy 421 RAQ KRVGFAAS P CAE I AGAAVS EISGPFSTE DVAS NCVP AQ S L S E P I LW I VS YALMS VC G 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I II I I I I I I I I I I I I 

Db 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N CVP AQ S L S E P I LW I VS YALMS VC G 480 

Qy 481 AI LLVLI VLLLLPFRCQRRPRDPEWNDES S LVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I 

Db 4 81 AILLVLIVLLLLPFRCQRRPRDPEWNDESS LVRHRWK 518 



RESULT 8 

US-09-978-295A-19 6 

; Sequence 196, Application US/09978295A 
; Patent No. US20020156006A1 
; GENERAL INFORMATION: 



A PPT TPfiMT 


. AsnKenazi, Avi 


A PPT TPRMT 1 


tsaKer J\evin r. 


a PPT TPZiMT 


. tjoEsrein, jjavict 


Ar r LI CAN I 


: Desnoyers, Luc 


ZiPPT TPflMT 1 
J\n JT Xj X WAIN 1 


» LdLon, Dan 


Ar IT J_i 1 CAN 1 , 


: Ferrara, Napoleon 


A P P T T^S MT < 
Air Jr Xi X L.AINJ 1 . 


: riivaroir, jzjxxen 


7\DDT Tr*A'M r Pi 

Ar r J_i 1 CAN 1 I 


: r ong, bnerman 


7\ T) T-) x T /"* 7\ >.TTi ■ 

ArrLlLANI : 


: Gao, Wei-Qiang 


ArrLILAN Jl : 


: Gerber, Hanspeter 


A "DDT TPTiMT 1 . 

Af jtLxCANI I 


: Gerritsen, Mary E . 


ArFLI LAN 1 : 


: Goddard, Audrey 


ArPLICAN I J 


: Godowski, Paul J. 


APPLICANT : 


: Grimaldi , J . Christopher 


APPLICANT : 


: Gurney , Austin L . 


APPLICANT : 


: Hillan, Kenneth J 


A "DDT T f" 1 A \T T 1 ■ 

AjrirLxCANl. . 


: j\i]avin, ivar J . 


APPLICANT : 


: Kuo, Sophxa S . 


APPLICANT : 


: Napier, Mary A. 


APPLICANT: 


^Y 

: Pan, James ; 


APPLICANT: 


Paoni, Nicholas F. 


APPLICANT: 


Roy, Margaret Ann 


APPLICANT: 


Shelton, David L . 


APPLICANT: 


Stewart, Timothy A. 


APPLICANT: 


Tumas , Daniel 


APPLICANT: 


Williams , P . Mickey 


APPLICANT: 


Wood, William I. 



TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2630P1C11 

CURRENT APPLICATION NUMBER: US/09/978, 295A 
CURRENT FILING DATE: 2001-10-15 
PRIOR APPLICATION NUMBER: 09/918585 
PRIOR FILING DATE: 2001-07-30 
PRIOR APPLICATION NUMBER: 60/062250 
PRIOR FILING DATE: 1997-10-17 
PRIOR APPLICATION NUMBER: 60/064249 
PRIOR FILING DATE: 1997-11-03 
PRIOR APPLICATION NUMBER: 60/065311 
PRIOR FILING DATE: 1997-11-13 
PRIOR APPLICATION NUMBER: 60/066364 
PRIOR FILING DATE: 1997-11-21 
PRIOR APPLICATION NUMBER: 60/077450 
PRIOR FILING DATE: 1998-03-10 
PRIOR APPLICATION NUMBER: 60/077632 
PRIOR FILING DATE: 1998-03-11 
PRIOR APPLICATION NUMBER: 60/077641 
PRIOR FILING DATE: 1998-03-11 
PRIOR APPLICATION NUMBER: 60/077649 
PRIOR FILING DATE: 1998-03-11 
PRIOR APPLICATION NUMBER: 60/077791 
PRIOR FILING DATE: 1998-03-12 
PRIOR APPLICATION NUMBER: 60/078004 
PRIOR FILING DATE: 1998-03-13 
PRIOR APPLICATION NUMBER: 60/078886 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/078936 



PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 



1998-03-20 
NUMBER: 60/078910 

1998-03-20 
NUMBER: 60/078939 

1998-03-20 
NUMBER: 60/079294 

1998-03-25 
NUMBER: 60/079656 

1998-03-26 
NUMBER: 60/079664 

1998-03-27 
NUMBER: 60/079689 

1998-03-27 
NUMBER: 60/079663 

1998-03-27 
NUMBER: 60/079728 

1998-03-27 
NUMBER: 60/079786 

1998-03-27 
NUMBER: 60/079920 

1998-03-30 
NUMBER: 60/079923 

1998-03-30 
NUMBER: 60/080105 

1998-03-31 
NUMBER: 60/080107 

1998-03-31 
NUMBER: 60/080165 

1998-03-31 
NUMBER: 60/080194 

1998-03-31 
NUMBER: 60/080327 

1998-04-01 
NUMBER: 60/080328 

1998-04-01 
NUMBER: 60/080333 

1998-04-01 
NUMBER: 60/080334 

1998-04-01 
NUMBER: 60/081070 

1998-04-08 
NUMBER: 60/081049 

1998-04-08 
NUMBER: 60/081071 

1998-04-08 
NUMBER: 60/081195 

1998-04-08 
NUMBER: 60/081203 

1998-04-09 
NUMBER: 60/081229 

1998-04-09 
NUMBER: 60/081955 

1998-04-15 
NUMBER: 60/081817 

1998-04-15 
NUMBER: 60/081819 
1998-04-15 



PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 



NUMBER: 60/081952 

1998-04-15 
NUMBER: 60/081838 

1998-04-15 
NUMBER: 60/082568 

1998-04-21 
NUMBER: 60/082569 

1998-04-21 
NUMBER: 60/082704 

1998-04-22 
NUMBER: 60/082804 

1998-04-22 
NUMBER: 60/082700 

1998-04-22 
NUMBER: 60/082797 

1998-04-22 
NUMBER: 60/082796 

1998-04-23 
NUMBER: 60/083336 

1998-04-27 
NUMBER: 60/083322 

1998-04-28 
NUMBER: 60/083392 

1998-04-29 
NUMBER: 60/083495 

1998-04-29 
NUMBER: 60/083496 

1998-04-29 
NUMBER: 60/083499 

1998-04-29 
NUMBER: 60/083545 

1998-04-29 
NUMBER: 60/083554 

1998-04-29 
NUMBER: 60/083558 

1998-04-29 
NUMBER: 60/083559 

1998-04-29 
NUMBER: 60/083500 

1998-04-29 
NUMBER: 60/083742 

1998-04-30 
NUMBER: 60/084366 

1998-05-05 
NUMBER: 60/084414 

1998-05-06 
NUMBER: 60/084441 

1998-05-06 
NUMBER: 60/084637 

1998-05-07 
NUMBER: 60/084639 

1998-05-07 
NUMBER: 60/084640 

1998-05-07 
NUMBER: 60/084598 

1998-05-07 
NUMBER: 60/084600 



; PRIOR FILING DATE 
; PRIOR APPLICATION 
; PRIOR FILING DATE 
; PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
; PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
; PRIOR APPLICATION 
; PRIOR FILING DATE 
; PRIOR APPLICATION 
; PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
; PRIOR FILING DATE 
PRIOR APPLICATION 
; PRIOR FILING DATE 
; PRIOR APPLICATION 
; PRIOR FILING DATE 
; PRIOR APPLICATION 



: 1998-05-07 

NUMBER: 60/084627 
: 1998-05-07 

NUMBER: 60/084643 
: 1998-05-07 

NUMBER: 60/085339 
: 1998-05-13 

NUMBER: 60/085338 
: 1998-05-13 

NUMBER: 60/085323 
: 1998-05-13 

NUMBER: 60/085582 
: 1998-05-15 

NUMBER: 60/085700 
: 1998-05-15 

NUMBER: 60/085689 
: 1998-05-15 

NUMBER: 60/085579 
: 1998-05-15 

NUMBER: 60/085580 
: 1998-05-15 

NUMBER: 60/085573 
: 1998-05-15 

NUMBER: 60/085704 
: 1998-05-15 

NUMBER: 60/085697 



Query Match 100.0%; Score 2687; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARAL L L P LLAQWL LRAAP E LAP AP FT L P L RVAAATN RWAPT P G P GT P AE READ GL 60 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MGALARAL L L P L LAQWL L RAAP E LAP AP FT L P L RVAAATN RWAP T P G P GT P AE RHAD G L 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 18 0 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I M I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 IVDSGTTLLRLPQKVFDAVVKAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 



Qy 



361 YLRDENS S RS FRI T I LPQLYI Q PMMGAGLN YEC YRFGI S PSTNALVI GATVMEGFYVI FD 420 
II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 



Db 



361 YLRDENS SRS FRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FD 42 0 



Qy 421 RAQ KRVG FAAS P CAE I AGAAVS E I S G P F S T EDVAS N C VP AQ SLSEPILWIVS YALMS VC G 480 

I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 421 RAQKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCG 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 9 
US-09-886-143-2 

Sequence 2, Application US/09886143 
Patent No. US20020159991A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Cordell, Barbara 
Schimmoller , Frauke 
Liu, Yu-Wang 
Quon, Diana Horn 
TITLE OF INVENTION: Modulation of A Levels by 
TITLE OF INVENTION: Secretase BACE2 
FILE REFERENCE: SCIOS.022A 

CURRENT APPLICATION NUMBER: US/ 09/886, 143 
CURRENT FILING DATE: 2001-06-20 
PRIOR APPLICATION NUMBER: 60/215,729 
PRIOR FILING DATE: 2000-06-28 
NUMBER OF SEQ ID NOS : 6 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 2 
LENGTH: 518 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-886-143-2 



Query Match 100.0%; Score 2687; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; 



Qy 

Db 



Gaps 



0; 



1 MGAliAI^ALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRVVAPTPGPGTPAERHADGL 60 

I I II I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERRADGL 60 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 



61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I II I I I I I I 
61 ALALE P ALAS PAGAAN FLAMVDNLQGD S GRG Y YLEML I GT P PQKLQ I LVDT GS SN FAVAG 120 

121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 18 0 

I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I 
121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

181 FESENFFLPGIKWNGILGLAYATL7VKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I 

181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 
I I II I I II I I I I I M II I I II I I I M I I I I I I I I I II I I I II I I I I I I I I II I I I II I I I 



Db 



241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 



Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I M II I I I I I I I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N C VP AQ S L S E P I LWI VS YALMS VC G 480 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | II I I I I I I 

Db 421 RAQKRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS NCVP AQ S L S E P I LWI VS YALMS VC G 480 

Qy 481 AI LLVLI VLLLLPFRCQRRPRDPEWNDES SLVRHRWK 518 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II II 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDES SLVRHRWK 518 

RESULT 10 
US-09-978-697-196 

; Sequence 196, Application US/09978697 
; Patent No. US20020169284A1 
; GENERAL INFORMATION: 



APPLICANT 


: Ashkenazi, Avi 


APPLICANT 


: Baker Kevin P. 


APPLICANT 


: Botstein, David 


APPLICANT 


: Desnoyers, Luc 


APPLICANT 


: Eaton, Dan 


APPLICANT 


: Ferrara, Napoleon 


APPLICANT 


: Filvarof f , Ellen 


APPLICANT 


: Fong, Sherman 


APPLICANT 


: Gao, Wei-Qiang 


APPLICANT 


: Gerber, Hanspeter 


APPLICANT' 


: Gerritsen, Mary E. 


APPLICANT: 


: Goddard, Audrey 


APPLICANT: 


: Godowski, Paul J. 


APPLICANT: 


: Grimaldi, J. Christopher 


APPLICANT: 


; Gurney, Austin L. 


APPLICANT: 


Hillan, Kenneth J 


APPLICANT: 


Kljavin, Ivar J. 


APPLICANT: 


Kuo, Sophia S. 


APPLICANT: 


Napier, Mary A. 


APPLICANT: 


Pan, James; 


APPLICANT: 


Paoni, Nicholas F. 


APPLICANT: 


Roy, Margaret Ann 


APPLICANT: 


Shelton, David L. 


APPLICANT: 


Stewart, Timothy A. 


APPLICANT: 


Tumas , Daniel 


APPLICANT: 


Williams, P. Mickey 


APPLICANT: 


Wood, William I. 



TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2630P1C27 

CURRENT APPLICATION NUMBER: US/09/978,697 
CURRENT FILING DATE: 2001-10-16 
PRIOR APPLICATION NUMBER: 09/918585 



PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 



2001-07-30 
NUMBER: 60/062250 

1997-10-17 
NUMBER: 60/064249 

1997-11-03 
NUMBER: 60/065311 

1997-11-13 
NUMBER: 60/066364 

1997- 11-21 
NUMBER: 60/077450 

1998- 03-10 
NUMBER: 60/077632 

1998-03-11 
NUMBER: 60/077641 

1998-03-11 
NUMBER: 60/077649 

1998-03-11 
NUMBER: 60/077791 

1998-03-12 
NUMBER: 60/078004 

1998-03-13 
NUMBER: 60/078886 

1998-03-20 
NUMBER: 60/078936 

1998-03-20 
NUMBER: 60/078910 

1998-03-20 
NUMBER: 60/078939 

1998-03-20 
NUMBER: 60/079294 

1998-03-25 
NUMBER: 60/079656 

1998-03-26 
NUMBER: 60/079664 

1998-03-27 
NUMBER: 60/079689 

1998-03-27 
NUMBER: 60/079663 

1998-03-27 
NUMBER: 60/079728 

1998-03-27 
NUMBER: 60/079786 

1998-03-27 
NUMBER: 60/079920 

1998-03-30 
NUMBER: 60/079923 

1998-03-30 
NUMBER: 60/080105 

1998-03-31 
NUMBER: 60/080107 

1998-03-31 
NUMBER: 60/080165 

1998-03-31 
NUMBER: 60/080194 

1998-03-31 
NUMBER: 60/080327 
1998-04-01 



PRIOR APPLICATION NUMBER: 60/080328 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/080333 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/080334 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/081070 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081049 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081071 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081195 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081203 
PRIOR FILING DATE: 1998-04-09 
PRIOR APPLICATION NUMBER: 60/081229 
PRIOR FILING DATE: 1998-04-09 
PRIOR APPLICATION NUMBER: 60/081955 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081817 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081819 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081952 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081838 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/082568 
PRIOR FILING DATE: 1998-04-21 
PRIOR APPLICATION NUMBER: 60/082569 
PRIOR FILING DATE: 1998-04-21 
PRIOR APPLICATION NUMBER: 60/082704 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082804 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082700 
PRIOA FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082797 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082796 
PRIOR FILING DATE: 1998-04-23 
PRIOR APPLICATION NUMBER: 60/083336 
PRIOR FILING DATE: 1998-04-27 
PRIOR APPLICATION NUMBER: 60/083322 
PRIOR FILING DATE: 1998-04-28 
PRIOR APPLICATION NUMBER: 60/083392 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/0834 95 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083496 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083499 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083545 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083554 



PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE : 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 



1998-04-29 
NUMBER: 60/083558 

1998-04-29 
NUMBER: 60/083559 

1998-04-29 
NUMBER: 60/083500 

1998-04-29 
NUMBER: 60/083742 

1998-04-30 
NUMBER: 60/084366 

1998-05-05 
NUMBER: 60/084414 

1998-05-06 
NUMBER: 60/084441 

1998-05-06 
NUMBER: 60/084637 

1998-05-07 
NUMBER: 60/084639 

1998-05-07 
NUMBER: 60/084640 

1998-05-07 
NUMBER: 60/084598 

1998-05-07 
NUMBER: 60/084600 

1998-5-07 
NUMBER: 60/084627 

1998-05-07 
NUMBER: 60/084643 

1998-05-07 
NUMBER: 60/085339 

1998-05-13 
NUMBER: 60/085338 

1998-05-13 
NUMBER: 60/085323 

1998-05-13 
NUMBER: 60/085582 

1998-05-15 
NUMBER: 60/085700 

1998-05-15 
NUMBER: 60/085689 

1998-05-15 
NUMBER: 60/085579 

1998-05-15 
NUMBER: 60/085580 

1998-05-15 
NUMBER: 60/085573 

1998-05-15 
NUMBER: 60/085704 

1998-05-15 
NUMBER: 60/085697 



Query Match 100.0%; Score 2687; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERH7VDGL 60 

I I I I I I I I I I I I I I M I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | 



Db 



1 MGAIARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 



QY 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I II I I I I I M I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | M I I I I 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I M II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | || 
Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I M I I I I I I I II I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | M | | | | | | | | | 
Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I || I I | 
Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I M | I I | | | | | | | | 
Db 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 42 0 

Qy 421 RAQ KRVGFAAS PCAE I AGAAVS EISGPFSTE DVAS NCVP AQ SLSEPILWI VS YALMS VC G 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N CVP AQ S L S E P I LW I VS YALMS VC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 81 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 11 

US-09-97 8-192A-196 

Sequence 196, Application US/09978192A 
Patent No. US20020177553A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Ashkenazi , Avi 
Baker Kevin P. 
Bots tein, David 
Desnoyers, Luc 
Eaton, Dan 
Ferrara, Napoleon 
Filvarof f , Ellen 
Fong, Sherman 
Gao, Wei-Qiang 
Gerber , Hanspeter 
Gerritsen, Mary E. 
Goddard, Audrey 
Godowski, Paul J. 
Grimaldi, J. Christopher 
Gurney, Austin L. 
Hillan, Kenneth J 
Kljavin, Ivar J. 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Kuo, Sophia S. 
Napier, Mary A. 
Pan, James; 
Paoni, Nicholas F. 
Roy, Margaret Ann 
Shelton, David L. 
Stewart, Timothy A. 
Tumas, Daniel 
Williams, P. Mickey 
Wood, William I . 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2630P1C9 

CURRENT APPLICATION NUMBER: US/09/978, 192A 
CURRENT FILING DATE: 2001-10-15 
PRIOR APPLICATION NUMBER: 09/918585 
PRIOR FILING DATE: 2001-07-30 
PRIOR APPLICATION NUMBER: 60/062250 
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; PRIOR APPLICATION NUMBER: 60/085579 

; PRIOR FILING DATE: 1998-05-15 

; PRIOR APPLICATION NUMBER: 60/085580 

PRIOR FILING DATE: 1998-05-15 

; PRIOR APPLICATION NUMBER: 60/085573 

; PRIOR FILING DATE: 1998-05-15 

; PRIOR APPLICATION NUMBER: 60/085704 

; PRIOR FILING DATE: 1998-05-15 

; PRIOR APPLICATION NUMBER: 60/085697 

Query Match 100.0%; Score 2687; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARAL L L P L LAQWL L RAAP E LAP AP FTL P L RVAAATN RWAPT PG P GT P AE RHAD GL 60 

I I I I I I I I M I I I I I I I I I I I I I M | I | | | | | | | M I I I I I I I ! I I I | | | | | | | | | | | | | 
Db 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRVVAPTPGPGTPAERHADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I | | | | | | | | | | | | | || | | | | | | | | 
Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

QY 121 TPHSYIDTYFDTERSSTYRSKGFDWVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I M I I I I I I I I II I I I I I I II I | | | | | | | | | | | | | | | | | | | | | || | | | | | | | | | | || | | 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I | M | | | | | 
Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I M I I I I I I I I I M I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 301 IVDSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS S RS FRI T I LPQLYI QPMMGAGLN YEC YRFGI S P STNALVI GATVMEGFYVT FD 420 

I I I I I M I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | || | | | | | | | | | | | | | | | | | | | | 
Db 361 YLRDENS SRS FRIT I LPQLYIQPMMGAGLNYECYRFGISP STNALVI GATVMEGFYVIFD 420 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS NCVP AQ S L S E P I LWI VS YALMS VCG 480 

I I I I I M I I II I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 
Db 421 RAQKRVGFAAS PCAEI AGAAVS EI SGPFSTEDVASNCVPAQSLSE PI LWI VS YALMS VCG 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518- 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 12 

US-09-999-832A-196 

; Sequence 196, Application US/09999832A 

; Publication No. US20020192706A1 

; GENERAL INFORMATION: 

; APPLICANT: Ashkenazi, Avi 
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TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2630P1C63 

CURRENT APPLICATION NUMBER: US/09/999, 832A 
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PRIOR FILING DATE: 1998-04-01 
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PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 
PRIOR APPLICATION 
PRIOR FILING DATE: 



1998-04-15 
NUMBER: 60/081838 

1998-04-15 
NUMBER: 60/082568 

1998-04-21 
NUMBER: 60/082569 

1998-04-21 
NUMBER: 60/082704 

1998-04-22 
NUMBER: 60/082804 

1998-04-22 
NUMBER: 60/082700 

1998-04-22 
NUMBER: 60/082797 

1998-04-22 
NUMBER: 60/082796 

1998-04-23 
NUMBER: 60/083336 

1998-04-27 
NUMBER: 60/083322 

1998-04-28 
NUMBER: 60/083392 

1998-04-29 
NUMBER: 60/083495 

1998-04-29 
NUMBER: 60/083496 

1998-04-29 
NUMBER: 60/083499 

1998-04-29 
NUMBER: 60/083545 

1998-04-29 
NUMBER: 60/083554 

1998-04-29 
NUMBER: 60/083558 

1998-04-29 
NUMBER: 60/083559 

1998-04-29 
NUMBER: 60/083500 

1998-04-29 
NUMBER: 60/083742 

1998-04-30 
NUMBER: 60/084366 

1998-05-05 
NUMBER: 60/084414 

1998-05-06 
NUMBER: 60/084441 

1998-05-06 
NUMBER: 60/084637 

1998-05-07 
NUMBER: 60/084639 

1998-05-07 
NUMBER: 60/084640 

1998-05-07 
NUMBER: 60/084598 

1998-05-07 
NUMBER: 60/084600 
1998-05-07 



PRIOR APPLICATION NUMBER: 60/084627 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/084643 
PRIOR FILING DATE: 1998-05-07 
PRIOR APPLICATION NUMBER: 60/085339 
PRIOR FILING DATE: 1998-05-13 
PRIOR APPLICATION NUMBER: 60/085338 
PRIOR FILING DATE: 1998-05-13 
PRIOR APPLICATION NUMBER: 60/085323 
PRIOR FILING DATE: 1998-05-13 
PRIOR APPLICATION NUMBER: 60/085582 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085700 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085689 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085579 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085580 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085573 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085704 
PRIOR FILING DATE: 1998-05-15 
PRIOR APPLICATION NUMBER: 60/085697 

Query Match 100.0%; Score 2687; DB 9; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MGALARAL LL P L LAQWL L RAAP ELAP AP FT L P L RVAAATN RWAP T P G P GT PAE RHAD GL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M | | | | | | | | | | | | | | | I I I I 
Db 61 ALALEPALAS PAGAANFLAMVDNLQGDS GRGYYLEMLI GTP PQKLQI LVDTGS SN FAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDWVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS S RS FRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVT FD 420 

I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 YLRDENS SRS FRITI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVT FD 420 



Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS NC VP AQ S L S E P I LW I VS YALMS VC G 480 

I I M I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 421 RAQKRVGFAAS PCAEI AGAAVS EI SGPFSTEDVASNCVPAQSLSEP I LWIVS YALMS VCG 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I I 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEVWDESSLVRHRWK 518 

RESULT 13 
US-09-978-189-196 

; Sequence 196, Application US/09978189 
; Publication No. US20030004102A1 
; GENERAL INFORMATION: 



A DDT T C* AWT 


: Ashkenazi, Avi 


APPT.TPAMT 

■rt.tr JT J_i J. O^nlM X 


• DdKci IVcvin ir ■ 


TS. DDT TP7\ 


: Botstein, David 


ADDT T f~*7\ KTT" 

/\r c LiL UAJN 1 


: Desnoyers, Luc 


APPLICANT 


: Eaton, Dan 


APPLICANT 


: Ferrara, Napoleon 


APPLICANT 


: Filvaroff, Ellen 


APPLICANT 


: Fong, Sherman 


APPLICANT 


: Gao, Wei-Qiang 


APPLICANT 


: Gerber, Hanspeter 


APPLICANT 


: Gerritsen, Mary E. 


APPLICANT 


: Goddard, Audrey 


APPLICANT 


: Godowski, Paul J. 


APPLICANT: 


: Grimaldi, J. Christopher 


APPLICANT : 


: Gurney, Austin L. 


APPLICANT: 


: Hillan, Kenneth J 


APPLICANT: 


Kljavin, Ivar J. 


APPLICANT: 


Kuo, Sophia S. 


APPLICANT: 


Napier, Mary A. 


APPLICANT: 


Pan, James; 


APPLICANT: 


Paoni, Nicholas F. 


APPLICANT: 


Roy, Margaret Ann 


APPLICANT: 


Shelton, David L. 


APPLICANT: 


Stewart, Timothy A. 


APPLICANT: 


Tumas, Daniel 


APPLICANT: 


Williams, P. Mickey 


APPLICANT: 


Wood, William I . 



TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2630P1C7 

CURRENT APPLICATION NUMBER: US/09/978,189 
CURRENT FILING DATE: 2001-10-15 
PRIOR APPLICATION NUMBER: 09/918585 
PRIOR FILING DATE: 2001-07-30 
PRIOR APPLICATION NUMBER: 60/062250 
PRIOR FILING DATE: 1997-10-17 
PRIOR APPLICATION NUMBER: 60/064249 
PRIOR FILING DATE: 1997-11-03 
PRIOR APPLICATION NUMBER: 60/065311 
PRIOR FILING DATE: 1997-11-13 
PRIOR APPLICATION NUMBER: 60/066364 
PRIOR FILING DATE: 1997-11-21 



PRIOR APPLICATION NUMBER: 60/077450 
PRIOR FILING DATE: 1998-03-10 
PRIOR APPLICATION NUMBER: 60/077632 
PRIOR FILING DATE: 1998-03-11 
PRIOR APPLICATION NUMBER: 60/077641 
PRIOR FILING DATE: 1998-03-11 
PRIOR APPLICATION NUMBER: 60/077649 
PRIOR FILING DATE: 1998-03-11 
PRIOR APPLICATION NUMBER: 60/077791 
PRIOR FILING DATE: 1998-03-12 
PRIOR APPLICATION NUMBER: 60/078004 
PRIOR FILING DATE: 1998-03-13 
PRIOR APPLICATION NUMBER: 60/078886 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/078936 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/078910 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/078939 
PRIOR FILING DATE: 1998-03-20 
PRIOR APPLICATION NUMBER: 60/079294 
PRIOR FILING DATE: 1998-03-25 
PRIOR APPLICATION NUMBER:. 60/079656 
PRIOR FILING DATE: 1998-03-26 
PRIOR APPLICATION NUMBER: 60/079664 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079689 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079663 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079728 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079786 
PRIOR FILING DATE: 1998-03-27 
PRIOR APPLICATION NUMBER: 60/079920 
PRIOR FILING DATE: 1998-03-30 
PRIOR APPLICATION NUMBER: 60/079923 
PRIOR FILING DATE: 1998-03-30 
PRIOR APPLICATION NUMBER: 60/080105 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080107 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080165 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080194 
PRIOR FILING DATE: 1998-03-31 
PRIOR APPLICATION NUMBER: 60/080327 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/080328 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/080333 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/080334 
PRIOR FILING DATE: 1998-04-01 
PRIOR APPLICATION NUMBER: 60/081070 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081049 



PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081071 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081195 
PRIOR FILING DATE: 1998-04-08 
PRIOR APPLICATION NUMBER: 60/081203 
PRIOR FILING DATE: 1998-04-09 
PRIOR APPLICATION NUMBER: 60/081229 
PRIOR FILING DATE: 1998-04-09 
PRIOR APPLICATION NUMBER: 60/081955 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081817 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081819 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081952 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/081838 
PRIOR FILING DATE: 1998-04-15 
PRIOR APPLICATION NUMBER: 60/082568 
PRIOR FILING DATE: 1998-04-21 
PRIOR APPLICATION NUMBER: 60/082569 
PRIOR FILING DATE: 1998-04-21 
PRIOR APPLICATION NUMBER: 60/082704 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082804 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082700 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082797 
PRIOR FILING DATE: 1998-04-22 
PRIOR APPLICATION NUMBER: 60/082796 
PRIOR FILING DATE: 1998-04-23 
PRIOR APPLICATION NUMBER: 60/083336 
PRIOR FILING DATE: 1998-04-27 
PRIOR APPLICATION NUMBER: 60/083322 
PRIOR FILING DATE: 1998-04-28 
PRIOR APPLICATION NUMBER: 60/083392 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083495 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083496 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083499 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083545 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083554 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083558 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083559 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083500 
PRIOR FILING DATE: 1998-04-29 
PRIOR APPLICATION NUMBER: 60/083742 
PRIOR FILING DATE: 1998-04-30 



PRIOR APPLICATION NUMBER: 60/084366 
; PRIOR FILING DATE: 1998-05-05 

PRIOR APPLICATION NUMBER: 60/084414 

PRIOR FILING DATE: 1998-05-06 

PRIOR APPLICATION NUMBER: 60/084441 

PRIOR FILING DATE: 1998-05-06 
; PRIOR APPLICATION NUMBER: 60/084637 
; PRIOR FILING DATE: 1998-05-07 

PRIOR APPLICATION NUMBER: 60/084639 
; PRIOR FILING DATE: 1998-05-07 

PRIOR APPLICATION NUMBER: 60/084640 
; PRIOR FILING DATE: 1998-05-07 

PRIOR APPLICATION NUMBER: 60/084598 

PRIOR FILING DATE: 1998-05-07 

PRIOR APPLICATION NUMBER: 60/084600 
; PRIOR FILING DATE: 1998-5-07 
; PRIOR APPLICATION NUMBER: 60/084627 
; PRIOR FILING DATE: 1998-05-07 

PRIOR APPLICATION NUMBER: 60/084643 

PRIOR FILING DATE: 1998-05-07 
; PRIOR APPLICATION NUMBER: 60/085339 
; PRIOR FILING DATE: 1998-05-13 
; PRIOR APPLICATION NUMBER: 60/085338 

PRIOR FILING DATE: 1998-05-13 

PRIOR APPLICATION NUMBER: 60/085323 

PRIOR FILING DATE: 1998-05-13 
; PRIOR APPLICATION NUMBER: 60/085582 
; PRIOR FILING DATE: 1998-05-15 

PRIOR APPLICATION NUMBER: 60/085700 
; PRIOR FILING DATE: 1998-05-15 
; PRIOR APPLICATION NUMBER: 60/085689 

PRIOR FILING DATE: 1998-05-15 

PRIOR APPLICATION NUMBER: 60/085579 
; PRIOR FILING DATE: 1998-05-15 

PRIOR APPLICATION NUMBER: 60/085580 
; PRIOR FILING DATE: 1998-05-15 

PRIOR APPLICATION NUMBER: 60/085573 
; PRIOR FILING DATE: 1998-05-15 
; PRIOR APPLICATION NUMBER: 60/085704 
; PRIOR FILING DATE: 1998-05-15 

PRIOR APPLICATION NUMBER: 60/085697 

Query Match 100.0%; Score 2687; DB 10; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERH7U)GL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MGAIARALLLPLIAQWLLRAAPEIAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADG^ 60 

Qy 61 ALALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 TPHSYIDTY-FDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 



Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS S RS FRI TI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 YLRDENS SRS FRI TI LPQLYIQPMMGAGLNYECYRFGI S PSTNALVI GATVMEGFYVI FD 420 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFST EDVAS NCVP AQ S L S E P I LW I VS YALMS VC G 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 RAQ KRVG FAAS PCAE I AGAAVS E I S G P F S T EDVAS N CVP AQ S L S E P I LW I VS YALMS VC G 480 

Qy 4 81 AI LLVLI VLLLLP FRCQRRPRDPEWNDES SLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 481 AILLVLIVLLLLP FRCQRRPRDPEWNDES SLVRHRWK 518 

RESULT 14 

US-09-978-608A-196 

; Sequence 196, Application US/09978608A 
; Publication No. US20030045462A1 
; GENERAL INFORMATION: 



APPLICANT: 


: Ashkenazi, Avi 


APPLICANT: 


: Baker Kevin P. 


APPLICANT: 


: Botstein, David 


APPLICANT: 


: Desnoyers, Luc 


APPLICANT: 


: Eaton, Dan 


APPLICANT: 


: Ferrara, Napoleon 


APPLICANT: 


: Filvaroff, Ellen 


APPLICANT: 


: Fong, Sherman 


APPLICANT: 


: Gao, Wei-Qiang 


APPLICANT: 


: Gerber, Hanspeter 


APPLICANT : 


: Gerritsen, Mary E. 


APPLICANT: 


: Goddard, Audrey 


APPLICANT: 


: Godowski, Paul J. 


APPLICANT: 


: Grimaldi, J. Christopher 


APPLICANT: 


: Gurney, Austin L. 


APPLICANT: 


: Hillan, Kenneth J 


APPLICANT: 


: Kljavin, Ivar J. 


APPLICANT: 


: Kuo, Sophia S. 


APPLICANT: 


: Napier, Mary A. 


APPLICANT: 


: Pan, James; 


APPLICANT: 


: Paoni, Nicholas F. 


APPLICANT: 


: Roy, Margaret Ann 


APPLICANT: 


Shelton, David L. 


APPLICANT: 


Stewart, Timothy A. 


APPLICANT: 


Tumas, Daniel 


APPLICANT: 


Williams, P. Mickey 



; APPLICANT: Wood, William I. 

; TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
; TITLE OF INVENTION: Acids Encoding the Same 
; FILE REFERENCE: P2630P1C22 

; CURRENT APPLICATION NUMBER: US/09/978, 608A 
; CURRENT FILING DATE: 2 001-10-16 
; NUMBER OF SEQ ID NOS : 624 

; Prior Application removed - See File Wrapper or Palm 
; SEQ ID NO 196 
; LENGTH: 518 

TYPE: PRT 
; ORGANISM: Homo sapien 
US-09-978-608A-196 

Query Match 100.0%; Score 2687; DB 10; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALLLPLLAQWLLRAAPEIJVPAPFTLPLRVAAATNR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I M I I I 
Db 1 MGALARALLLPLLAQWLLI^PEIJVPAPFTLPLRVAAATO 60 

Qy 61 ALALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I M I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I II I I I I I 

Db 61 ALAL E P ALAS P AGAAN FLAMVDN LQ G D S GRG Y Y L EML IGTPPQKLQI L VDT G S S N FAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | 
Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FES EN F FL P G I KWNGI L GLAYAT LAK PSSSLETFFDS LVT QAN I PN VF SMQMC GAGL P VA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 FESENFFLPGI KWNGI LGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

Qy 421 RAQKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | | | | | | | | | | 
Db 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS NCVP AQ S L S E P I LW I VS YALMS VC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



RESULT 15 

US-09-978-585A-196 



Sequence 196, Application US/09978585A 
Publication No. US20030049633A1 
GENERAL INFORMATION : 



APPLICANT 


: Ashkenazi* Avi 


APPLICANT 


: Baker Kevin P. 


APPLICANT 


: Botstein. David 


APPLICANT 


: Desnovers. Luc 


APPLICANT 

J XX. X» X^ xx ~ ^ 


: Eaton Dan 


APPLICANT 

X X A*. X* 1 J *X» *W*X XX V .X 


: Ferrara. Nanol eon 


APPLICANT 

X XX- J— J_l _L, V^X XX t _l_ 


; Fi 1 \rR rof f TCl 1 pn 

* -L» — L_ V U X* X. X. ^ X— J L \w- X X 


APPLICANT: 

X X 1— 1— 1 1 1 V—'X XX ™ X. 4 


: Fona Sherman 

► J— w X X ^—4 » X X V-- J X L Ld-X X X 


APPLICANT: 

X XX. JL X_l -L_ L1 1 JL < 


: Gao . Wei — Oi ancr 


APPLICANT ' 


> vj _l_ v — J_ , llullOUvUv^X 


APPLICANT ' 


> r ^ J_ J l_ O »3 11 ^ 1 Id X, y X-J • 


APPLICANT ■ 

X LJ_ 1- J J _1_ V^x XX i _L ■ 


1 CrodHa rd AuHrpv 


APPLICANT « 


* f^oH ow^ k i Paul .T 


APPLICANT * 

iiL J_ t I _i_ villi X 4 


' Grinrt^lH") iT Ph ri £ ^"hriTiVif : ' r 


APPLICANT * 


frii rnpu Aii c ?'h"in Ti 

V f r\UO UX11 XJ * 


APPLICANT ■ 


W "i 1 1 pi ri PCf^rm p*t" Vi ,T 

ilX J LCLll^ IVCllilC L-XX KJ 




x\_L J d VlU, ±VclI. O • 


APPLICANT: 


Kuo, Sophia S. 


APPLICANT: 


Napier, Mary A. 


APPLICANT: 


Pan, James; 


APPLICANT: 


Paoni, Nicholas F. 


APPLICANT: 


Roy, Margaret Ann 


APPLICANT: 


Shelton, David L. 


APPLICANT: 


Stewart, Timothy A. 


APPLICANT: 


Tumas, Daniel 


APPLICANT: 


Williams, P. Mickey 


APPLICANT: 


Wood, William I. 



TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
; TITLE OF INVENTION: Acids Encoding the Same 
; FILE REFERENCE: P2630P1C15 

; CURRENT APPLICATION NUMBER: US/09/978, 585A 
; CURRENT FILING DATE: 2001-10-16 
; NUMBER OF SEQ ID NOS : 624 

; Prior Application removed - See File Wrapper or Palm 
; SEQ ID NO 196 

LENGTH: 518 

TYPE: PRT 

ORGANISM: Homo sapien 
US-09-978-585A-196 

Query Match 100.0%; Score 2687; DB 10; Length 518; 

Best Local Similarity 100.0%; Pred. No. 2.8e-240; 

Matches 518; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARAL L L P L LAQWL L RAAP ELAP AP FT L P LRVAAATN RWAP T P G P GT P AE RHAD GL 60 

I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I II I I I I I I I I I I II I I I I 

Db 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHM 60 

Qy 61 ALALE PALAS P AGAAN FLAMVDNLQGDS GRG Y YLEML I GT P PQKLQ I L VDT GS SN FAVAG 120 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 ALALEPAIjASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I 



Db 



121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 



Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENS SRS FRITI LPQLYIQPMMGAGLNYEC YRFGI S PSTNALVI GATVMEGFYVI FD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I II I I I I I I I I I I I I I I I I I I I 

Db 361 YLRDENS S RS FRI TI LPQLYI QPMMGAGLNYEC YRFGI S P STNALVI GATVMEGFYVI FD 420 

Qy 421 RAQKRVGFAASPCAEIAGAAVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCG 480 

I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 421 RAQ KRVG FAAS P C AE I AGAAVS E I S G P F S T EDVAS N C VP AQ S L S E P I LWI VS YALM SVC G 480 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 



Search completed: February 28, 2004, 09:20:27 
Job time : 36.6364 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: February 28, 2004, 09:05:50 ; Search time 18.5 Seconds 

(without alignments) 
2693.363 Million cell updates/sec 

Title: US-09-668-314C-2 
Perfect score: 2687 

Sequence: 1 MGALARALLLPLLAQWLLRA RPRDPEWNDESSLVRHRWK 518 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 283366 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : 



PIR_78: * 
1: pirl:* 
2: pir2:* 
3: pir3:* 
4: pir4:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 
A59090 

aspartic proteinase (EC 3.4.23.-) BACE precursor - human 

N; Alternate names: beta-secretase; beta-site APP cleaving enzyme 

C; Species: Homo sapiens (man) 

C;Date: 29-Oct-1999 #sequence_revision 29-Oct-1999 #text_change ll-May-2000 
C;Accession: A59090 

R;Vassar, R. ; Bennett, B.D.; Babu-Khan, S.; Kahn, S.; Mendiaz, E.A. ; Denis, P.; 
Teplow, D.B.; Ross, S.; Amarante, P.; Loeloff, R. ; Luo, Y. ; Fisher, S.; Fuller, 
J.; Edenson, S.; Lile, J. ; Jarosinski, M.A. ; Biere, A.L.; Curran, E. ; Burgess, 
T.; Louis, J.C.; Collins, F. ; Treanor, J. ; Rogers, G.; Citron, M. 
Science 286, 735-741, 1999 

A;Title: beta-Secretase cleavage of Alzheimer's amyloid precursor protein by the 
transmembrane aspartic protease BACE. 

A; Reference number: A59090; MUID : 20002972 ; PMID : 10531052 
A; Note: submitted to GenBank, September 1999 
A; Accession: A59090 

A; Status: not compared with conceptual translation 
A; Molecule type: mRNA 
A; Residues: 1-501 <VAS> 



A;Cross-references: GB:AF190725; NID: g6118538 ; PIDN: AAF04142 . 1; PID:g6118539 
C ; Genetics : 
A; Gene: BACE 

C; Superf amily : beta-secretase 

C; Keywords: Alzheimer's disease; aspartic proteinase; brain; glycoproteins- 
hydrolase; protein digestion; transmembrane protein; zymogen 
F; 1-21/Domain : signal sequence ftstatus predicted <SIG> 
F; 22-45/Domain: propeptide #status predicted <PRO> 
F;46-501/Product: acid proteinase BACE (fstatus predicted <MAT> 
F; 4 61-4 7 7 /Domain: transmembrane #status predicted <TRN> 
F; 93, 289/Active site: Asp ffstatus predicted 

F; 153, 172,223, 354 /Binding site: carbohydrate (Asn) (covalent) ftstatus predicted 
F; 330-380/Disulfide bonds: #status predicted 

Query Match 43.9%; Score 1178.5; DB 2; Length 501; 

Best Local Similarity 46.2%; Pred. No. 3.8e-80; 

Matches 240; Conservative 82; Mismatches 164; Indels 33; Gaps 9; 



Qy 


7 


ALLLPLLAQWLLRAAPELAPAPFT L P L RVAAATN RWAPT P G P GT P AE RHAD GLA 

| M I 1 : : 1 1 1 1 1 1 1 l> 'I 
AQALPWLLLWM GAGVLPAHGTQHGIRLPLRSGLG GAPL GLR 


61 


Db 


2 


42 


Qy 


62 


LALE — PALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVA 

| | | : | : 1 1 M 1 : 1 1 1 : 1 1 1 : 1 1 : 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

LPRETDEEPEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVG 


119 


Db 


43 


102 


Qy 
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GTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIAT 

ii i. .11111 i i i i l l l l * 1 1 1 1 * 1 1 1 1 * Ml 
| | : : | : : 1 1 II 1 1 1 II 1 1 1 I * 1 1 1 1 ■ 1 l l l * i ' ' 

TUVPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRT^IAA 


179 


Db 


103 


162 


Qy 


180 


IFESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV 

| ||: ||: | I MINIM : 1 : 1 Ml 1 1 1 1 1 1 1 : : 1 1 : M : 1 : 1 1 1 1 1 : 

ITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPL 


239 


Db 


163 


222 


Qy 
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AGS GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYN 

| : | M : : : | | | : III 1 : II 1 1 1 : II II : : 1 : : : 1 1 II 1 : M : M 1 
NQS EVLAS VGGSMI I GGI DHS L YTGS LW YT PI RREWYYEVI I VRVEINGQDLKMDCKEYN 


296 


Db 


223 


282 


Qy 


297 


ADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFP 

M : 1 1 II 1 II 1 1 M : 1 M : 1 1 : : : II : 1 1 1 1 1 1 II 1 1 1 1 1 : 1 1 

YDKS I VDSGTTNLRLPKKVFEAAVKS I KAAS STEKFPDGFWLGEQLVCWQAGTTPWNI FP 


356 


Db 


283 


342 


Qy 


357 


KI S I YLRDENS S RS FRIT I LPQLYI QPMMGAGLN Y- ECYRFGI S P STNALVI GATVMEGF 

M : 1 1 1 : : : 1 1 1 1 1 1 1 1 1 1 : : 1 : : M 1 M 1 1 1 : 1 : M : 1 1 1 1 

VI SLYLMGEVTNQS FRITI LPQQYLRPVEDVATSQDDCYKFAI SQS STGTVMGAVIMEGF 


415 


Db 


343 


402 


Qy 


416 


WIFDRAQKRVGFAASPCAEIAG7\AVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYAL 

1 1 : 1 1 1 1 : 1 1 : 1 1 1 1 1 : : 1 1 1 1 1 : 1 1 : : 1 : 

YWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQTDESTLMTIAYVM 


475 


Db 


403 


462 


Qy 


476 


MSVCGAILLVLIVLLLLPFRCQR — RPRDPEWNDESSL 512 




Db 


463 


::| |: :: : |:: Mil 1 : : Mil 

AAIC-ALFMLPLCLMVCQWRCLRCLRQQHDDFADDISLL 500 





RESULT 
JC7573 



2 



pepsinogen C - African clawed frog 

N;Alternate names: progastricsin 

C; Species: Xenopus laevis (African clawed frog) 

C;Date: 30-Jun-2001 #sequence_revision 30-Jun-2001 #text_change 03-Aug-2001 
C;Accession: JC7573; PC7118 

R;Ikuzawa, M. ; Inokuchi, T.; Kobayashi, K. ; Yasumasu, S. 
J. Biochem. 129, 147-153, 2001 

A; Title: Amphibian pepsinogens: Purification and characterization of Xenopus 

pepsinogens, and molecular cloning of Xenopus and bullfrog pepsinogens. 

A;Reference number: JC7573; MUID: 21064922; PMID : 11134969 

A; Contents: Stomach 

A;Accession: JC7573 

A;Molecule type: mRNA 

A; Residues: 1-383 <IKU> 

A;Cross-references : DDBJ: AB045379 

A; Accession: PC7118 

A;Molecule type: protein 

A; Residues: 17-68 <IK2> 

C; Comment: This protein is a zymogen for gastric aspartic proteinase, with 

pepsin-like activity. 

C; Genetics: 

A; Gene : PgC 

C; Superf amily : pepsin 

C; Keywords: stomach; zymogen 

Query Match 13.7%; Score 367.5; DB 2; Length 383; 

Best Local Similarity 28.9%; Pred. No. 9.1e-20; 

Matches 132; Conservative 70; Mismatches 154; Indels 101; Gaps 25; 

MGAIARALLLPLIAQWLLRAAPEIAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

| I I I : I : : : : I I =1 I : I I II = • = 

MTfTTT.TT.aT.vrT.nT.ciF.fiTTR VPT.KKFKSMREVMRENGI KAPLVDPAT KYYNOY 52 



| | | | : I : : I I I I : I I I I I I : I I I I I II II 



Qy 


1 


Db 


1 


Qy 


61 


Db 


53 


Qy 


121 


Db 


96 


Qy 


174 


Db 


146 


Qy 


225 


Db 


199 


Qy 


283 


Db 


253 


Qy 


343 


Db 


302 



I : : I I I I I : : : I I I I I : I I 



| : | : | I II : : : I M I I II : : I : : I : : I I : 



225 PN--VFSMQMCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLE 282 

| : | : | I III: | | : : : I I I : : I I : I I : II I 

199 LNQPIFGFYLSGQ SSQNGGEVAFGGVDQNYYTGQIYWTPVTSETYWQIGIQGFS 252 



III: I : : I I II : II : I I I I II 



343 ACWTNSETPWSYFPKISIYLRDENSSRSFRITILPQLYI-QPMMGAGLNYECYRFGIS— 399 

: | : III: III: I I : I I I I I 

302 VSCSNIQN LPTISFTI SGVSFPLP — PSAYVLQQSSG YC-TIGIMPT 345 



Qy 400 — PSTNA LVT GATVMEGFYVI FDRAQKRVGFAAS 431 

Ml : : I : : I : : I : I I I I : 

Db 34 6 YLPSQNGQPLWTLGDVFLREYYSVYDLGNNQVGFATA 382 



RESULT 3 
PEMQCJ 

gastricsin (EC 3.4.23.3) precursor - Japanese macaque (fragment) 
N;Alternate names: pepsin C 

C;Species: Macaca fuscata (Japanese macaque) 

C;Date: 13-Aug-1986 #sequence_revision 19-Oct-1995 #text_change 18-Jun-1999 
C;Accession: S19683; A00986; A22402; S16066 
R;Kageyama, T.; Tanabe, K. ; Koiwai, O. 
Eur. J. Biochem. 202, 205-215, 1991 

A; Title: Development-dependent expression of isozymogens of monkey pepsinogens 
and structural differences between them. 

A; Reference number: S19681; MUID : 92037 645 ; PMID: 1935977 
A;Accession: S19683 
A;Molecule type: mRNA 
A; Residues: 1-377 <KAG> 

A; Cross-references: EMBL:X59754; NID:g38072; PIDN : CAA42426 . 1 ; PID:g38073 

R;Kageyama, T.; Takahashi, K. 

J. Biol. Chem. 261, 4406-4419, 1986 

A; Title: The complete amino acid sequence of monkey progastricsin . 
A; Reference number: A00986; MUID : 86168133; PMID: 3514597 
A/Accession: A00986 
A;Molecule type: protein 

A;Residues: 6-330, 'V , 332-349, 'VY ' , 350-377 <KA2> 
R;Kageyama, T.; Takahashi, K. 
J. Biochem. 97, 1235-1246, 1985 

A; Title: Monkey pepsinogens and pepsins. VII. Analysis of the activation process 

and determination of the NH2-terminal 60-residue sequence of Japanese monkey 

progastricsin, and molecular evolution of pepsinogens. 

A;Reference number: A22402; MUID: 85289106; PMID:3928607 

A;Accession: A22402 

A;Molecule type: protein 

A; Residues: 6-65 <KA3> 

C; Comment: This enzyme has more restricted specificity than pepsin A. 

C; Comment: The enzyme is activated in a two-step process that gives rise to two 

end products. The shorter, Ser-gastricsin, is the major product. 

C; Superf amily : pepsin 

C; Keywords: aspartic proteinase; gastric juice; hydrolase; protein digestion; 
stomach 

F; 1-5/Domain: signal sequence (fragment) ffstatus predicted <SIG> 
F; 6-377/Product : progastricsin #status experimental <ZYM> 
F; 6-4 5/ Domain : activation peptide #status experimental <APT> 
F;46-377/Product : Gly-gastricsin #status experimental <MIN> 
F;49-377/Product : Ser-gastricsin ftstatus experimental <MAT> 
F;31-32/Cleavage site: Phe-Leu (pepsin) ftstatus experimental 
F;45-46/Cleavage site: Phe-Gly (pepsin) ftstatus experimental 
F;48-49/Cleavage site: Leu-Ser (pepsin) ftstatus experimental 
F; 80, 265/Active site: Asp ftstatus predicted 

F;9 3_98^ 256-260, 299-332/Disulf ide bonds: ftstatus experimental 



Query Match 13.5%; Score 363.5; DB 1; 

Best Local Similarity 28.9%; Pred. No. 1.8e-19; 
Matches 118; Conservative 65; Mismatches 118; 



Length 377; 

Indels 107; Gaps 19; 



Qy 56 HADGLALALEPALAS PAGAANFLAMVT)NLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSN 115 

| | : : : I I : I : I I : I : I I I I I I : I I I I I I I 

Db 44 HFGDLSVSYEP MAYMD AAYFGEISIGTPPQNFLVLFDTGSSN 85 

Qy H6 FAV AGTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPK 167 

| I I I I I : I I I I : I : : : I II I I I I I : I : 

Db 86 LWVPSVYCQSQACTSHS RFNPSESSTYSTNGQTFSLQYGSGSLTGFFGYDTLTV-- 139 

Qy 168 GFNTSFLVNIATIFESENFFLPG IKWNGILGLAYATLAKPSSSLETFFDSLVTQA 222 

| | Ml II : : : I I : I I I I I I : : : I : I : 

Db 14 0 QSIQVPNQEFGLSEN— EPGTNFVYAQFDGIMGLAYPTLSVDGAT — TAMQGMVQEG 192 

Qy 223 NI PN- VFSMQMCGAGLPVAGS GTNGGS LVLGGI E P S LYKGDI WYT P I KEEW YYQI EI LKL 281 

: : : I I : : I : : I I : : I I I : : I I I I I : : I : : I I : I I I : 

Db 193 ALTSPIFSVYLSDQ QGSSGGAWFGGVDSSLYTGQI YWAPVTQELYWQIGIEEF 246 

Qy 282 EIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQ 341 

MM: || : | II I : M : II : I I : I : : : I I I : I 

Db 247 LI GGQAS GW- CS E — GCQAI VDTGT S LLTVPQQYMSALLQA TGAQ 288 

Qy 342 LACWTNSETPWSYF PKISI YLRDENSSRSFRITILPQLYIQPMMGAGLNY 391 

| : | I : : : : I II II 
Db 289 EDEYGQFLVNCNSIQNLPTLTFII NGVEFPLPPSSYI LNN 328 

Qy 392 ECY-RFGISP STNALVI GATVMEGFYVI FDRAQKRVGFAAS 431 

| |:| | : : I : : I : : I : Mill: 

Db 329 NGYCTVGVEPTYLSAQNSQPLWILGDVFLRSYYSVYDLSNNRVGFATA 37 6 



RESULT 4 
A39314 

gastricsin (EC 3.4.23.3) precursor - bullfrog 
C; Species: Rana catesbeiana (bullfrog) 

C;Date: 19-Jun-1992 #sequence_revision 19-Jun-1992 #text_change 22-Jun-1999 
C;Accession: A39314 

R;Yakabe, E. ; Tanji, M. ; Ichinose, M. ; Goto, S.; Miki, K. ; Kurokawa, K. ; Ito, 

H-; Kageyama, T . ; Takahashi, K. 

J. Biol. Chem. 266, 22436-22443, 1991 

A; Title: Purification, characterization, and amino acid sequences of pepsinogens 

and pepsins from the esophageal mucosa of bullfrog (Rana catesbeiana) . 

A; Reference number: A39314; MUID : 92042186; PMID: 1939266 

A;Accession: A39314 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 1-384 <YAK> 

A;Cross-references: GB:M73750; NID:g213687; PIDN : AAA49530 . 1 ; PID:g213688 
C; Super family : pepsin 

C;Keywords: aspartic proteinase; hydrolase; protein digestion 

Query Match 13.2%; Score 355.5; DB 2; Length 384; 

Best Local Similarity 26.5%; Pred. No. 7.2e-19; 

Matches 120; Conservative 73; Mismatches 136; Indels 123; Gaps 21; 

Qy 23 E LAP AP FT L P L RVAAATN RW APTPGPGT PAERHADGLALALEPALAS PAGAAN 7 6 

: | : : M : : I : I I I I :: : I I I I II 

Db 12 QLSEGIIKVPLKKFKSMREVMRDHGIKAPWDPAT K YYNN FATAFE P LAN 61 



Qy 77 FLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDT Y 129 

: : | 111:111111 : I I I I I I I I I • I : 

Db 62 YMDM SYYGEISIGTPPQNFLVLFDTGSSNLWV PSTYCQSQACTNHPQ 108 

Qy 130 FDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFL- 188 

I : : I I : I I : : : I I I I I : I I I I I I I : I I 

Db 109 FNPSQSSSYSSNQQQFSLQYGTGSLTGILGYDTVQIQ NIA— ISQQEFGLS 157 

Qy 189 PG IKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPN — VFSMQMCGAGLP 238 

|| : : : I I I I I I I : : I : : : I : : I I : I : I : : I 

Db 158 VTEPGTNFVYAQFDGILGLAYPSIAEGGAT — TVMQGMI-QQNLINQPLFAFYLSG 210 

Qy 239 VAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNAD 298 

: |||: I I : : : I I I : : I I : I I : I I I : I I : I • 

Db 211 -QQNSQNGGEVAFGGVDQNYYSGQIYWTPVTSETYWQIGIQGFSVNGQATGW-CSQ--GC 2 66 

Qy 299 KAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQ LACWTNSET 350 

: I I I : I I : I I II II :::::: I : I : : I 

Db 267 QGIVDTGTSLLTAPQSVFSSLMQSI GAQQDQNGQYAVSCSNIQSL 311 

Qy 351 PWSYFP KISIYLRDENS S RS FRI T I LPQLYI QPMMGAGLN YECYRFGI S 399 

| | | | : : | I : III II : 
Db 312 PTISFTISGVSFPLPPSAYVLQQNSGYCTIGIMPTYLPSQNGQPLW 357 

Qy 400 PSTNALVI GATVMEGFYVI FDRAQKRVGFAAS 431 

: : I : : I : : I : I I I I I : 

Db 358 1 LGDVFLRQYYSVYDLGNNQVGFAAA 383 



RESULT 5 
JE0371 

pepsin C (EC 3.4.23.-) precursor - chicken 
N; Alternate names: pepsinogen C 
C; Species: Gallus gallus (chicken) 

C;Date: 23-Jul-1999 #sequence_revision 23-Jul-1999 #text_change ll-May-2 000 
C;Accession: JE0371 

R;Sakamoto, N . ; Saiga, H.; Yasugi, S. 

Biochem. Biophys . Res. Commun. 250, 420-424, 1998 

A; Title: Analysis of temporal expression pattern and cis-regulatory sequences of 
chicken pepsinogen A and C. 

A; Reference number: JE0370; MUID : 98440813 ; PMID: 9753645 

A;Accession: JE0371 

A; Status : preliminary 

A; Molecule type: mRNA 

A; Residues: 1-389 <SAK> 

C; Super f amily : pepsin 

C; Keywords: aspartic proteinase; hydrolase 

Query Match 13.2%; Score 355; DB 2; Length 389; 

Best Local Similarity 28.7%; Pred. No. 7.9e-19; 

Matches 114; Conservative 58; Mismatches 121; Indels 104; Gaps 16; 

Qy 75 ANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGT PHSYI 12 6 

: | | : | : I I I : I I I I I I : I I I I I I I I I I : 

Db 56 SNFATAYEPLANNMDMSYYGEISIGTPPQNFLVLFDTGSSNLWVPSTLCQSQACANHN 113 



Qy 


127 


Db 


114 


Qy 


173 


Db 


171 


Qy 


231 


Db 


213 


Qy 


291 


Db 


267 


Qy 


351 


Db 


315 


Qy 


395 


Db 


363 



FDTERSSTYRSKGFDWVKYTQGSWTGFVGEDLVTIPKGFN TS 172 

|| III::: : : : I I I I I I I I I : I : I I 



||: : : | M | | I : : : I : I : | I : : I I I 



I : I I I I I I : : I : I I I I : II : : hill : I I I I 



291 DCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSET 350 

I : : I I I : I I : I I : I : I I : : : : : I : I : I : I I 

267 -CSQ — GCQGIVDTGTSLLTVPNQVFTELMQYIG AQADD SGQYVASCSNIE- 314 



III | I : I I : III II: 

. — YMPTITFVISGTSFPLPPSAYMLQSNSDYCTVGIESTYLPSQTGQPLW 362 

395 RFGI S PSTNALVIGATVMEGFYVT FDRAQKRVGFAAS 431 

: : I : : I I : I : I I I I : 

363 1 LGDVFLRVYYS I YDMGNNQVGFATA 388 



RESULT 6 
A29937 

gastricsin (EC 3.4.23.3) precursor - human 
N; Alternate names: pepsin C; pepsinogen C 
C; Species: Homo sapiens (man) 

C;Date: 17-Oct~1988 #sequence_revision 17-Oct-1988 #text_change 31-Mar-2000 
C;Accession: A29937; A31811; PX0028; 154213; A91125; A23458 
R;Hayano, T . ; Sogawa, K. ; Ichihara, Y.; Fuj ii-Kuriyama, Y. ; Takahashi, K. 
J. Biol. Chem. 263, 1382-1385, 1988 

A; Title: Primary structure of human pepsinogen C gene. 
A;Reference number: A29937; MUID : 88087276; PMID:3335549 
A;Accession: A29937 
A; Molecule type: DNA 
A; Residues: 1-388 <HAY> 

R;Taggart, R.T.; Cass, L.G.; Mohandas, T.K.; Derby, P.; Barr, P.J.; Pals, G.; 
Bell, G.I. 

J. Biol. Chem. 264, 375-379, 1989 

A; Title: Human pepsinogen C (progas tricsin) . Isolation of cDNA clones, 
localization to chromosome 6, and sequence homology with pepsinogen A. 
A;Reference number: A31811; MUID : 89079679 ; PMID:2909526 
A;Accession: A31811 
A;Molecule type: mRNA 
A; Residues: 1-388 <TAG> 

A; Cross-references: GB:J04443; NID:g551175; PIDN: AAA60074 . 1 ; PID:g551176 
R;Athauda, S.B.P.; Tanji, M. ; Kageyama, T.; Takahashi, K. 
J. Biochem. 106, 920-927, 1989 

A; Title: A comparative study on the NH2-terminal amino acid sequences and some 

other properties of six isozymic forms of human pepsinogens and pepsins. 

A; Reference number: PX0023; MUID : 90130402 ; PMID:2515193 

A; Accession: PX0028 

A; Molecule type: protein 

A; Residues: 17-101 <ATH> 



R;Pals, G.; Azuma, T.; Mohandas, T.K.; Bell, G.I.; Bacon, J.; Samloff, I.M. ; 
Walz, D.A. ; Barr, P.J.; Taggart, R.T. 
Genomics 4, 137-148, 1989 

A;Title: Human pepsinogen C (progastricsin) polymorphism: evidence for a single 
locus located at 6p21.1-pter. 

A;Reference number: 154213; MUID : 89290840 ; PMID:2567697 
A;Accession: 154213 

A; Status: translated, from GB/EMBL/DDB J 
A; Molecule type: DNA 
A; Residues: 1-388 <RES> 

A; Cross-references : GB:M23077; NID:gl89830; PIDN :AAA60063 . 1 ; PID:g387015; 
GB : J03063 

A; Note: parts of this sequence, including the amino end and carboxyl ends of the 

mature protein, were determined by protein sequencing 

R;Foltmann, B.; Jensen, A.L. 

Eur. J. Biochem. 128, 63-70, 1982 

A;Title: Human progastricsin. Analysis of intermediates during activation into 

gastricsin and determination of the amino acid sequence of the propart. 

A;Reference number: A91125; MUID : 83079318 ; PMID: 6816595 

A;Accession: A91125 

A; Molecule type: protein 

A;Residues: 17-39, ' ED 1 , 42-51, ' S 1 , 53-64 <FOL> 
A;Note: pro-form; 29-Leu was also found 

A;Note: activation at pH 2 is proposed to involve conformation change, cleavage 

after Phe-42, and cleavage after Leu-59 

C; Genetics : 

A; Gene: GDB: PGC 

A; Cross-references: GDB:119485; OMIM:169740 
A;Map position: 6p21 . 3-6p21 . 1 

A;Introns: 20/2; 70/3; 110/1; 149/3; 216/2; 256/2; 305/3; 338/3 
C; Superf amily : pepsin 

C;Keywords: aspartic proteinase; hydrolase; protein digestion; stomach; zymogen 
F; 1-1 6/ Domain : signal sequence ((status predicted <SIG> 
F; 17 -59/ Domain : propeptide #status experimental <PRO> 
F; 60-38 8/ Product : gastricsin #status experimental <MAT> 

Query Match 13.1%; Score 353; DB 2; Length 38 8; 

Best Local Similarity 29.1%; Pred. No. l.le-18; 

; Conservative 65; Mismatches 120; Indels 108; Gaps 21; 

PAERHAI)G-LALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLI GTPPQKLQI LVD 110 
||:: ||:: I I : I : I I : I : I I I I II : I I 

PAWKYRFGDLSVTYEP MAYMD AAYFGE I S I GT P PQN FLVL FD 91 

111 TGSSNFAV AGTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDL 162 

I I I I I I I I I I I : I I I I : I : : : I I I I I I I I 

TGSSNLWVPSVYCQSQACTSHS RFNPSESSTYSTNGQTFSLQYGSGSLTGFFGYDT 147 



Matches 


12i 


Qy 


52 


Db 


50 


Qy 


111 


Db 


92 


Qy 


163 


Db 


148 


Qy 


218 


Db 


199 



: I : I I III II : : : | | : I I I I I : : : I 

LTV QSIQVPNQEFGLSEN — EPGTNFVYAQFDGIMGLAYPALSVDEAT — TAMQG 198 



||:: | : : I I : : [ I I : : I I I I I : : I : : I I : I I 

FSVYLSNQ QGSSGGAWFGGVDSSLYTGQIYWAPVTQELYWQI 252 



Qy 



277 EILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGF 336 



I : Mil: II : | | | | : | | : M : I I : I : : : I 

Db 253 GIEEFLIGGQASGW-CSE — GCQAIVDTGTSLLTVPQQYMSALLQA 295 

Qy 337 WTGSQLACWTNSETPWSYF PKISIYLRDENSSRSFRITILPQLYIQPMMG 386 

I I : I I : I I : : : : I I I 

Db 296 -TGAQ EDEYGQFLVNCNSIQNLPSLTFII NGVEFPLPPSSYI 336 

Qy 387 AGLNYECY-RFGISP STNA LVI GATVMEGFYVI FDRAQKRVGFAAS 431 

I : I I : I II : : I : : I : : I Mill: 

Db 337 — LSNNGYCTVGVEPTYLSSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATA 387 



RESULT 7 
JC7246 

pepsinogen C - common marmoset 

C; Species: Callithrix jacchus (common marmoset) 

C;Date: 09-Jun-2000 #sequence_revision 09-Jun-2000 #text_change 21-Jul-2000 
C;Accession: JC7246 
R;Kageyama, T. 

J. Biochem. 127, 761-770, 2000 

A; Title: New world monkey pepsinogens A and C, and prochymosins . Purification, 

characterization of enzymatic properties, cDNA cloning, and molecular evolution 

A; Reference number: JC7245 

A; Accession : JC724 6 

A; Molecule type: mRNA 

A;Residues: 1-388 <KAG> 

A; Cross-references : DDBJ: AB038385 

A; Experimental source: strain NW791 

C;Comment: This protein, a zymogen of pepsins, is the major proteolytic enzyme 
in vertebrate gastric juices. It plays roles in gastric digestion, and is a 
useful molecular marker for clarifying the evolution of mammalian orders and 
families . 

C; Super family : pepsin 

C; Keywords: gastric juice; zymogen 

Query Match 13.1%; Score 351.5; DB 2; Length 388; 

Best Local Similarity 30.1%; Pred. No. 1.4e-18; 

Matches 112; Conservative 56; Mismatches 115; Indels 89; Gaps 17; 

Qy 92 YYLEML I GT P PQKLQ I LVDT GS SN FAV AGTPHSYIDTYFDTERSSTYRSKGF 14 3 

I : I : M I I I I : I I II II I I MM I : M II I I 

Db 73 Y FGE I S I GT P PQN FLVL FDT GS SNLWVP S VYCQ S QACT S H S RFNPSASSTYSSNGQ 128 

Qy 144 DVTVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNIATI FESENFFLPG IKWNGILG 198 

: : : I I I I I I I I : I : I I III II : : : I I : I 

Db 129 TFSLQYGSGSLTGFFGYDTLTV QSIQVPNQEFGLSEN — EPGTNFVYAQFDGIMG 181 

Qy 199 LAYATLAKPSSSLETFFDSLVTQANIPN-VFSMQMCGAGLPVAGSGTNGGSLVLGGIEPS 257 

Ml I : : : I — : : : I I I : | : : | | : : : || : : I 

Db 182 LAYPALSMGGAT — TAMQGMLQEGALTSPVFSFYLSNQ QGS S GGAVI FGGVD S S 233 

Qy 258 LYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFD 317 

II I I : : I : : I I : I I I : I II I : II : II M : II : II : II : 

Db 234 L YT GQ I YWAP VTQEL YWQ IGIEEFLI GGQAS GW- C S E — GCQAIVDTGTSLLTVPQQYMS 290 

Qy 318 AWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYF PKISIYLRDENS 367 

I : I I I I : I I : I I : : : 



D b 291 AFLEA TGAQ EDEYGQFLVNCDSIQNLPTLTFII 323 

Qy 368 SRSFRITILPQLYIQPMMGAGLNYECY-RFGISP STNAIA/T GATVMEGFYVT F 419 

: I I I I : I I : I I : : I : : I : I 

Db 324 -NGVEFPLPPSSYI LSNNGYCTVGVEPT YLS SQNSQPLWI LGDVFLRS YYSVF 375 



Qy 420 DRAQKRVGFAAS 431 

I Mill: 
Db 37 6 D L GNN RVG FATA 387 



RESULT 8 
B43356 

gastricsin (EC 3.4.23.3) precursor - guinea pig 

N; Alternate names: pepsin C 

C;Species: Cavia porcellus (guinea pig) 

C;Date: 03-Feb-1994 #sequence_revision 03-Feb~1994 #text_change 22-Jun-1999 
C;Accession: B43356 

R;Kageyama, T.; Ichinose, M. ; Tsukada, S.; Miki, K. ; Kurokawa, K.; Koiwai, O.; 
Tanji, M. ; Yakabe, E.; Athauda, S.B.; Takahashi, K. 
J. feiol- Chem. 267, 16450-16459, 1992 

A;Title: Gastric procathepsin E and progastricsin from guinea pig. Purification, 
molecular cloning of cDNAs, and characterization of enzymatic properties, with 
special reference to procathepsin E. 

A; Reference number: A43356; MUID : 92355614 ; PMID: 1644829 
A;Accession: B43356 
A; Status : preliminary 
A;Molecule type: mRNA 
A; Residues: 1-394 <KAG> 

A; Cross-references: GB:M88652; NID:gl91296; PIDN : AAA37053 . 1; PID:gl91297 
A;Note: sequence extracted from NCBI backbone (NCBIN: 110805, NCBIP: 110806) 
C; Superf amily : pepsin 

C;Keywords: aspartic proteinase; gastric juice; hydrolase; protein digestion; 
stomach 

Query Match 12.1%; Score 324.5; DB 2; Length 394; 

Best Local Similarity 29.0%; Pred. No. 1.5e-16; 



Matches 


107; Conservative 63; Mismatches 116; Indels 83; Gaps 


18; 


Qy 


92 


YYLEMLIGTPPQKLQILVDTGSSNF AVAGTPHSYIDTYFDTERSSTYRSKGF 

I : : : : 1 1 1 1 1 1 : 1 1 1 1 1 1 1 : : 1 1 1 1 1= INI: 
YFGQISLGTPPQSFQVLFDTGSSNLWVPSVYCSSLACTTH TRFNPRDSSTYVATDQ 


143 


Db 


79 


134 


Qy 


144 


DVTVKYTQGSWTGFVGEDLVTI PK-GFNTSFLVNIATIFESENFFLPG IK 

: : : 1 1 1 1 1 1 1 : 1 1 II 1 1 hi M : 

SFSLEYGTGSLTGVFGYDTMTIQDIQVPKQEFGLS ETE PGSDFVYAE 


192 


Db 


135 


181 


Qy 


193 


WNGILGLAYATLAKPSSSLETFFDSLVTQANI-PNVFSMQMCGAGLPVAGS — GTNGGSL 

:: I I I 1 1 1 I : : : : 1 1 : : : : : 1 1 : : II 1 : : 1 1 

FDGILGLGYPGLSEGGAT— TAMQGLLREGALSQSLFSVYL GSQQGSDEGQL 


249 


Db 


182 


231 


Qy 


250 


VLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLL 

: I 1 1 : : 1 1 1 1 II : : 1 1 : : 1 1 : 1 1 1 1 1 : 1 : I M : I 1 : 1 1 

ILGGVDESLYTGDIYWTPVTQELYWQIGIEGFLIDGSASGWCSR GCQGIVDTGTSLL 


309 


Db 


232 


288 


Qy 


310 


RLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSR 


369 



Db 289 TVPSDYLSTLVQAIGAEE — NEYGEYF VSCSSIQDLPTLTFVISGV 



Qy 370 SFRITILPQLYIQP MMGAGLNYECYRFGISPSTN — ALVIGATVMEGFYVI FDRA 422 

: I I I I : I I : I I "I : : I : : I I 

D b 333 — EFPLSPSAYILSGENYCMVGLESTY VSPGGGEPVWILGDVFLRSYYSVYDLA 384 

Qy 423 QKRVGFAAS 431 

I I I I I : 

Db 385 NNRVGFATA 393 



RESULT 9 
JC7575 

pepsinogen A - bullfrog 

C; Species: Rana catesbeiana (bullfrog) 

C;Date: 30-Jun-2001 #sequence_revision 30-Jun-2001 #text_change 03-Aug-2001 
C;Accession: JC7575 

R;Ikuzawa, M. ; Inokuchi, T . ; Kobayashi, K. ; Yasumasu, S. 
J. Biochem. 129, 147-153, 2001 

A; Title: Amphibian pepsinogens: Purification and characterization of Xenopus 

pepsinogens, and molecular cloning of Xenopus and bullfrog pepsinogens. 

A; Reference number: JC7573; MUID : 21064922 ; PMID: 11134969 

A; Contents: Stomach 

A;Accession: JC7575 

A;Molecule type: mRNA 

A; Residues: 1-385 <IKU> 

A; Cross-references : DDB J: ABO 4 5 37 6 

C; Comment: This protein is a zymogen for gastric aspartic proteinase, with 

pepsin-like activity. 

C; Genetics : 

A; Gene: PgA 

C; Superf amily : pepsin 

C; Keywords: stomach; zymogen 

Query Match 11.9%; Score 320; DB 2; Length 385; 

Best Local Similarity 27.8%; Pred. No. 3.2e-16; 

Matches 111; Conservative 67; Mismatches 147; Indels 74; Gaps 15; 

^TPAERHADGLALALEPAIASPAGAANFLAMW 109 

: : I | I : I I : I : I I I :: I I I I I I : : 

;dylkkhhynpatkyfpslaqasg EPLQNYMDIEYFGTISIGTPPQSFTVIF 90 



QY 


50 


Db 


39 


Qy 


110 


Db 


91 


Qy 


163 


Db 


148 


Qy 


222 


Db 


203 


Qy 


281 


Db 


256 



| | | | | I I I : | : : : | | | : : : I : : : I I I : I I : I I 



II : : I I I I I I : : I I II II- I 



| | : : | | : : : I : I : : I I : : I I I : : : I : I I : I I : 



||||: : I I I I : II : I I I I : : I : ' : 



Qy 341 QLACWTNSETPWSYFP KISIYLRDENSS — RSFRITILPQLYIQPMMGAGLN 390 

: I I I I I I : I I I : I I 

Db 306 -INCNNISNMPTWFTINGVQYPLPASAYVRQSQQSCTSGFQAMNLP 351 

Qy 391 YECYRFGISPSTNALVIGATVMEGFYVIFDRAQKRVGFA 429 

: I : : : I : : I I : I I I I I I 

Db 352 T S S GDLWI LGDVFI REYYWFDRANNYVAMA 382 



RESULT 10 
REMSK 

renin (EC 3.4.23.15) precursor, renal - mouse 

N;Alternate names: angiotensin- forming enzyme; angiotensinogenase; renin 1 
C; Species: Mus musculus (house mouse) 

C;Date: 30-Jun-1987 #sequence_revision 30-Jun-1987 #text_change 18-Jun-1999 
C;Accession: A00989; S07636; A22766; A22058; 157576; A05137; JH0083 
R;Holm, I.; Olio, R. ; Panthier, J.J.; Rougeon, F. 
EMBO J. 3, 557-562, 1984 

A;Title: Evolution of aspartyl proteases by gene duplication: the mouse renin 

gene is organized in two homologous clusters of four exons . 

A; Reference number: A00989; MUID: 84182525 ; PMID: 6370686 

A;Accession: A00989 

A; Molecule type: DNA 

A; Residues: 1-402 <HOL> 

A; Cross-references : EMBL:X008 50 

R;Kim, W.S.; Murakami, K.; Nakayama, K. 

Nucleic Acids Res. 17, 9480, 1989 

A; Title: Nucleotide sequence of a cDNA coding for mouse Renl preprorenin. 
A;Reference number: S07636; MUID : 90067 953 ; PMID:2685761 
A;Accession: S07636 
A;Molecule type: mRNA 
A; Residues: 1-402 <KIM> 

A; Cross-references: EMBL:X16642; NID:g53930; PIDN : CAA34 636 . 1 ; PID:g53931 
R;Mullins, J.J. ; Burt, D.W.; Windass, J.D.; McTurk, P.; George, H. ; Brammar, 
W.J. 

EMBO J. 1, 1461-1466, 1982 

A; Title: Molecular cloning of two distinct renin genes from the DBA/2 mouse. 

A; Reference number: A90968; MUID : 84207 899 ; PMID: 6327270 

A;Accession: A22766 

A;Molecule type: mRNA 

A; Residues: 269-314 , 1 D 1 , 316 <MUL> 

R; Panthier, J.J.; Dreyfus, M. ; Roux, D.T.L.; Rougeon, F. 
Proc. Natl. Acad. Sci. U.S.A. 81, 5489-5493, 1984 

A;Title: Mouse kidney and submaxillary gland renin genes differ in their 5' 
putative regulatory sequences. 

A; Reference number: A22058; MUID : 84298161 ; PMID: 6089205 
A;Accession: A22058 
A;Molecule type: DNA 
A; Residues: 1-30 <PAN> 

R; Field, L.J.; Philbrick, W.M. ; Howies, P.N.; Dickinson, D.P.; McGowan, R.A. ; 
Gross, K.W. 

Mol. Cell. Biol. 4, 2321-2331, 1984 

A;Title: Expression of tissue-specific Ren-1 and Ren-2 genes of mice: 
Comparative analysis of 5 '-proximal flanking regions. 
A; Reference number: 157576; MUID : 85085936; PMID: 6392850 
A;Accession: 157576 



A; Status: preliminary; translated from GB/EMBL/DDB J 
A;Molecule type: DNA 
A; Residues: 1-31 <RES> 

A; Cross-references: GB:K02800; NID:g200689; PIDN : AAA40044 . 1; PID:g200690 
C; Comment: The only known function of renal renin is to release angiotensin I 
from angiotensinogen in the plasma, initiating a cascade of reactions that 
produces an elevation of blood pressure and increased sodium retention by the 
kidney . 

C; Comment: Renal renin is synthesized by the juxtaglomerular cells of the kidney 
in response to decreased blood pressure and sodium concentration. 
C; Genetics : 
A; Gene: Ren-1 

A;Introns: 31/2; 81/3; 123/1; 162/3; 228/2; 268/2; 316/3; 349/3 
C; Superf amily : pepsin 

C;Keywords: aspartic proteinase; blood pressure control; glycoprotein; 
hydrolase; kidney; plasma 

F; 1-21 /Domain : signal sequence #status predicted <SIG> 
F;22-64/Domain: propeptide ffstatus predicted <PRO> 
F;65-402/Product: renin #status predicted <MAT> 

F; 69, 139, 320/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F; 102, 287/Active site: Asp ((status predicted 

Query Match 11-9%; Score 320; DB 1; Length 402; 

Best Local Similarity 28.6%; Pred. No. 3.4e-16; 

Matches 126; Conservative 66; Mismatches 181; Indels 68; Gaps 21; 

Qy 10 L P L LAQWLLRAAP E LAP AP FT L P LRVAAATN RWAP T P G- P GT P AERHAD GLALAL E 65 

: II I I I : I I : I I I I : I III hi 

Db 6 MPLWALLLL WSPCTFSLPTRTATFERIPLKKMPSVREILEERGVDMTRLSAEWGV 60 

Qy 66 PA LASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAV 118 

I : III I : I II I I I : I I I I I I : : : I I I I : I I 

Db 61 FTKRP S LTNLT S P WLTN YL NTQ YYGEIGIGTPPQTFKVIFDTGSANLWV 110 

Qy 119 AGTPHSY IDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTS 172 

I I I : : : : I I : I I I I : I I I I : : I I I : I : 

Db 111 PSTKCSRLYLACGIHSLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDSVTV-GGITVT 169 

Qy 173 FLWIATIFESENFFLPGIKWNGILGLAYATL7UCPSSSLETFFDSLVTQANI-PNVFSMQ 231 

I || | : : | : M : : I : : I I : : : I : III: 

Db 170 QTFGEVTELPLIPFML — AKFDGVLGMGFP — AQAVGGVTPVFDH1LSQGVLKEEVFSVY ,225 

Qy 232 MCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLD 291 

II I I : I I I I : I I : I : I I : : I I : : : I I I 

Db 226 Y NRGSHLLGGEWLGGSDPQHYQGNFHYVSISKTDSWQITMKGVSVG — SSTLL 277 

Qy 292 CREYNADKAIVDSGTTLLRLPQKVFDAWEAV-ARASLIPEFSDGFWTGSQLACWTNSET 350 

II I : | | : | : : : | : : : | : | : I I : : I I : 
Db 278 CEEGCA — WVDTGSSFISAPTSSLKLIMQALGAKEKRIEEY WNC SQV 324 

Qy 351 PWSYFPKISIYLRDENSSRSFRITILPQLYIQPMMGAGL-NYECYRFGISPSTNAL-VIG 408 

| I I I I I : : : : : I I : I I I : I : I 

Db 325 P — TLPDISFDL GGRAYTLSSTDYVLQYPNRRDKLCTLALHAMDI PPPTGPVWVLG 378 



Qy 409 ATVMEGFYVI FDRAQKRVGFA 429 

II: II IN hill 
Db 37 9 ATFIRKFYTEFDRHNNRIGFA 399 



RESULT 11 
S66516 

oryzasin (EC 3.4.23.-) precursor - rice 
N;Alternate names: aspartic proteinase 1 
C; Species: Oryza sativa (rice) 

C;Date: 28-Oct-1996 #sequence_revision 13-Mar-1997 #text_change 20-Jun-2000 
C /Accession: S66516; S 665 17 

R;Asakura, T.; Watanabe, H. ; Abe, K. ; Arai, S. 
Eur. J. Biochem. 232, 77-83, 1995 

A;Title: Rice aspartic proteinase, oryzasin, expressed during seed ripening and 
germination, has a gene organization distinct from those of animal and microbial 
aspartic proteinases. 

A/Reference number: S66516; MUID : 9604 8031 ; PMID:7556174 
A; Acces si on : S 6 65 16 
A; Molecule type: DNA 
A; Residues: 1-509 <ASA> 

A; Cross-references: EMBL:D32165; NID:g511665; PIDN : BAA06876 . 1 ; PID:gl030715 
A; Accession: S 6 65 17 
A;Molecule type: mRNA 
A; Residues: 1-509 <ASZ> 

A;Cross-references: EMBL:D32144; NID : gl255684 ; PIDN : BAA06875 . 1 ; PID:gl711289 
C; Comment: The pair of saposin repeat homology domains tagged SAPl and SAP 2 
represent a cyclical permutation of a single saposin repeat. 
C * Genetics : 

A;Introns: 119/3; 140/1; 171/3; 209/2; 265/3; 279/1; 300/3; 338/3; 360/2; 412/3; 
452/3; 482/2 

C; Super family : oryzasin; saposin repeat homology 

C; Keywords: aspartic proteinase; hydrolase 

F; 1-20 /Domain : signal sequence #status predicted <SIG> 

F;21-68/Domain: propeptide tstatus predicted <PRO> 

F; 68-509/Product : aspartic proteinase 1 ((status predicted <MAT> 

F; 316-361/Domain: saposin repeat homology #status atypical <SAP1> 

F; 370-420/Domain: saposin repeat homology #status atypical <SAP2> 

F; 103, 290/Active site: Asp #status predicted 

Query Match 11.7%; Score 313.5; DB 2; Length 509; 

Best Local Similarity 23.0%; Pred. No. 1.5e-15; 

Matches 127; Conservative 75; Mismatches 179; Indels 171; Gaps 19; 

ALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGLAL 62 

: : I I I : I I I I I : I II : I I I I I I I I 

SVALVLLAAVLLQALLPASAEEGLVRIALKKRPIDENSRVAARLSG EEGARRLGL 59 

ALEPAIAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSN 115 

: I I : : | : : : I : I : : I I I I I I :: I I I II I 



Qy 


3 


Db 


5 


Qy 


63 


Db 


60 


Qy 


116 


Db 


116 


Qy 


174 


Db 


166 



I : : I II : : : I I II : I : : I II II MM: 



II: I :: I I I I I : 



Qy 225 PNVFSMQMCG-AGLPVAGSGTN GGSLVLGGIEPSLYKGDIWYTPIKEEWYYQI 276 

I : I I II I I I : I I I : : I I I I I : | I : : : I : I 

Db 207 PVWYKMVEQGLVSEPVFSFWFNRHSDEGEGGEIVFGGMDPSHYKGNHTYVPVSQKGYWQF 266 

Qy 277 EILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPE 331 

I : : I I I : : I : I I I I I I : I I I : : I : : : : 

Db 2 67 EMGDVLIGGKTTGF-CA — SGCSAIADSGTSLLAGPTAI ITEINEKIGATGWSQECKTV 323 

Qy 332 FSDGF 336 

I : I 

Db 324 VSQYGQQILDLLLAETQPSKICSQVGLCTFDGKHGVSAGIKSWDDEAGESNGLQSGPMC 383 

Qy 337 WTGSQLACWTNSETPWSY : FPKISIYLRD 364 

I : II I : : I I : I I : 

Db 384 NACEMAVVWMQNQLAQNKTQDLI LNYINQLCDKLPS PMGES SVDCGSLASMPEI S FTI GA 443 

Qy 365 ENSSRSFRITILPQLYIQPMMGAGLNYECY RFGI S PSTNAL-VT GATVMEGFYVI F 419 

: : : | : | | : I I : I II I : : I I : : : I 

Db 444 K KFALKPEEYIL-KVGEG7WVQCISGFTAMDIPPPRGPLWILGDVFMGAYHTVF 496 

Qy 420 DRAQKRVGFAAS 431 

I : I I I I I I 
Db 497 DYGKMRVGFAKS 508 



RESULT 12 
A24608 

gastricsin (EC 3.4.23.3) precursor - rat 
N;Alternate names: pepsinogen C 
N;Contains: pepsin A (EC 3.4.23.1) precursor 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 30-Jun-1988 #sequence_revision 05-Aug-1994 #text_change 18-Jun-1999 
C;Accession: A33510; A24608; C22434; A05145; A61298 

R;Ishihara, T . ; Ichihara, Y. ; Hayano, T . ; Katsura, I.; Sogawa, K. ; Fujii- 

Kuriyama, Y.; Takahashi, K. 

J. Biol. Chem. 264, 10193-10199, 1989 

A;Title: Primary structure and transcriptional regulation of rat pepsinogen C 
gene . 

A;Reference number: A33510; MUID : 89255508 ; PMID:2722863 

A; Accession: A33510 

A; Molecule type: DNA 

A; Residues: 1-392 <ISH> 

A; Cross-ref erences : GB:M25985 

R; Ichihara, Y. ; Sogawa, K.; Morohashi, K. ; Fujii-Kuriyama, Y. ; Takahashi, K. 
Eur. J. Biochem. 161, 7-12, 1986 

A; Title: Nucleotide sequence of a nearly full-length cDNA coding for pepsinogen 
of rat gastric mucosa. 

A;Reference number: A24608; MUID: 87054020; PMID:3780741 
A; Accession: A24608 
A; Molecule type: mRNA 
A; Residues: 1-392 <ICH> 

A;Cross-ref erences: GB:X04644; NID:g56880; PIDN : CAA28305 . 1 ; PID:g56881 
R;Ichihara, Y. ; Sogawa, K. ; Takahashi, K. 
J. Biochem. 98, 483-492, 1985 

A;Title: Isolation of human, swine, and rat prepepsinogens and calf 
preprochymosin, and determination of the primary structures of their NH2- 
terminal signal sequences. 



A; Reference number: A22434; MUID: 86059312; PMID:2415509 
A; Accession: C22434 
A;Molecule type : protein 

A;Residues: 1-19, ' X 21-23, 1 X \ 25-29 <IC2> 

R;Arai, K.M.; Muto, N.; Tani, S.; Akahane, K. 

Biochim. Biophys . Acta 788, 256-261, 1984 

A;Title: The N-terminal sequence of rat pepsinogen. 

A; Reference number: A05145; MUID: 84257697; PMID: 6743670 

A;Accession: A05145 

A;Molecule type : protein 

A;Residues: 17-30 , * Q *, 32-102 , 'A' , 1 04-108 , 1 L 1 , 110-112 <ARA> 
A; Experimental source: Wistar strain 
R;Ichihara, Y. ; Sogawa, K. ; Takahashi, K. 
J. Biochem. 92, 603-606, 1982 

A;Title: Rat gastric prepepsinogen : in vitro synthesis and partial amino- 
terminal signal sequence. 

A; Reference number: A61298; MUID : 83030750 ; PMID: 6182139 
A; Accession: A61298 
A;Molecule type : protein 

A; Residues: 1, 'XX', 4-6, , X , ,8-9, 'X' ,11, 'X', 13-14, 'XXX', 18- 
19, 'X' ,21, 'X 1 ,23, 'XX' ,2 6, 'X' <IC3> 

C; Comment: This enzyme has more restricted specificity than pepsin A. It is the 
major form of pepsinogen in rat gastric mucosa. 
C;Genetics : 

A;Introns: 20/2; 73/3; 113/1; 152/3; 219/2; 259/2; 309/3; 342/3 
A;Note: there are at least two very similar genes for gastricsin in rat 
C; Super f amily : pepsin 

C; Keywords: aspartic proteinase; gastric juice; hydrolase; protein digestion; 
stomach 

F; 1-16/Domain : signal sequence #status experimental <SIG> 
F;17-392/Product: pepsinogen #status experimental <MAT> 
F;17-62/Domain: activation peptide #status experimental <ACT> 
F; 94, 280/Active site: Asp #status predicted 

F; 107-112, 270-275, 314-347/Disulf ide bonds: ((status predicted 

Query Match 11.6%; Score 313; DB 1; Length 392; 

Best Local Similarity 29.5%; Pred. No. l.le-15; 

Matches 105; Conservative 56; Mismatches 139; Indels 56; Gaps 16; 

92 YYLEMLIGTPPQKLQILVDTGSSNFAV AGTPHSYIDTYFDTERSSTYRSKGF 143 

I : I : I I I II I : I I I I I I I I III: I : : I I I I :: I 

76 YFGEI S I GTPPQNFLVLFDTGSSNLWVS SVYCQSEACTTHA RFNPSKSSTYYTEGQ 131 

144 DVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG IKWNGILG 198 

: : : I I I I I I I I : I : I I III II : : : | | : I 

132 TFSLQYGTGSLTGFFGYDTLTV QSIQVPNQEFGLSEN — EPGTNFVYAQFDGIMG 184 

199 LAYATLAKP S S S LET FFDS LVTQANI PNVFSMQMCGAGLPVAGS — GTNGGSLVLGGIEP 256 

IN I : I I : : : I : I I II I : I I I : I I I : : 

185 LAY P GL S — S GGATT ALQGMLGE GALSQPLFGVYL GSQQGSNGGQIVFGGVDK 235 

257 SLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVF 316 

: I I I : I : I : : I I : I I I III: I : j | | : | I : I | : 1 : 

236 NLYTGEITWVPVTQELYWQITIDDFLIGDQASGW-CSSQGC-QGIVDTGTSLLVMPAQYL 293 

317 DAWEAV7VRASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITIL 376 
: : : : I : : I : : I : I I : I I : : 



Qy 


92 


Db 


76 


Qy 


144 


Db 


132 


Qy 


199 


Db 


185 


Qy 


257 


Db 


236 


Qy 


317 



Db 2 94 SELLQTIGAQE — GEYGEYF VSCDSVSS LPTLSFVL NGVQFPLS 335 



Qy 377 PQLY-IQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRAQKRVGFAAS 431 

I I I I : I :: I : : I I II : I I I I 

Db 336 PSSYIIQEDNFCMVGLESISLTSESGQPLWILGDVFLRSYYAIFDMGNNKVGLATS 391 



RESULT 13 
A41443 

pepsin (EC 3.4.23.-) precursor, embryonic - chicken 
C; Species: Gallus gallus (chicken) 

C;Date: 05-Jun-1992 #sequence_revision 05-Jun-1992 #text_change 21-Jul-2000 
C; Accession: A41443 

R;Hayashi, K.; Agata, K. ; Mochii, M. ; Yasugi, S.; Eguchi, G. ; Mizuno, T. 
J. Biochem. 103, 290-296, 1988 

A; Title: Molecular cloning and the nucleotide sequence of cDNA for embryonic 

chicken pepsinogen: phylogenetic relationship with prochymosin. 

A; Reference number: A41443; MUID : 88227903 ; PMID: 3131317 

A; Accession: A41443 

A; Status: preliminary 

A; Molecule type: mRNA 

A; Residues: 1-383 <HAY> 

A; Cross-references: GB:D00215; NID: g27608 10 ; PIDN: BAA00153 . 1 ; PID:g222853 
C; Superfamily : pepsin 

C;Keywords: aspartic proteinase; hydrolase; protein digestion 

Query Match 11.5%; Score 310; DB 2; Length 383; 

Best Local Similarity 26.8%; Pred. No. 1.8e-15; 

Matches 106; Conservative 63; Mismatches 136; Indels 90; Gaps 15; 

Qy 56 HA — DGLALALEPAIASPAGAANFLAMVDNL 113 

1111:111 III 11:111111 : : I I I I 

Db 55 HAFPDVLTWTEPLL NTLDM EYYGT ISIGT P PQD FT WFDTGS 97 

Qy 114 SNFAVAG TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGF 169 

II I : I I : : I | | | : | | : : : : | I I I I I I I : 

Db 98 SNLWVPSVSCTSPACQSHQMFNPSQSSTYKSTGQNLSIHYGTGDMEGTVGCDTVTVASLM 157 

Qy 170 NTSFLWIATIFESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANI-PNVF 228 

: I : I : : I I II: : I : : I I I I I I : I I : : I I : : I : : : hi 

Db 158 DTNQLFGLST-SEPGQFFV-YVKFDGILGLGYPSLA--ADGITPVFDNMVNESLLEQNLF 213 

Qy 229 SMQMCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSL 2 88 

I : : : I : I I I I : I : I I : I : : I : I I : : : I : 

Db 214 SVYLSREPM GSMWFGGIDESYFTGSINWIPVSYQGYWQISMDSIIVNKQEI 265 

Qy 289 NLDCREYNMKAIVI)SGTTLLRLPQKVFDAWEAVARASLIPEFSDG™TGSQLACWTNS 348 

: : I I : I : I I : I : I : : I I I 

Db 266 ACS S GC QAI I DT GT S L VAG PAS D I N D I Q S AVG ANQ 300 

Qy 349 ETPWSYFPKISIYLRDENSSRSFRITILPQLYIQPMMGAGLNY ECY 394 

II I I : I : : : I I : I I 

Db 301 NTYGEY SVNCSHILAMPDWF — VIG-GIQYPVPALAYTEQNGQGTCM 345 

Qy 395 RFGISPST N ALVI GAT VME G F YVI FD RAQ KRVG FA 429 

: I : : : I : : I I I I I I I I I I 

Db 346 SSFQNSSADLWILGDVFIRVYYSIFDRANNRVGLA 38 0 



RESULT 14 
KHHUD 

cathepsin D (EC 3.4.23.5) precursor [validated] - human 
N;Alternate names : preprocathepsin D 
C; Species : Homo sapiens (man) 

C;Date: 28-Dec-1987 #sequence_revision 28-Dec-1987 #text_change 15-Sep-2000 

C;Accession: A25771; S30749; PC2066; 159236; 157716 

R;Faust, P.L.; Kornfeld, S.; Chirgwin, J.M. 

Proc. Natl. Acad. Sci. U.S.A. 82, 4910-4914, 1985 

A; Title: Cloning and sequence analysis of cDNA for human cathepsin D. 
A; Reference number: A25771; MUID: 85270436; PMID: 3927292 
A;Accession: A25771 
A;Molecule type: mRNA 
A; Residues: 1-412 <FAU> 

A;Cross-references : EMBL:M11233; NID:gl81179; PIDN : AAB59529 . 1 ; PID:gl81180 

R;Westley, B.R.; May, F.E.B. 

Nucleic Acids Res. 15, 3773-3786, 1987 

A;Title: Oestrogen regulates cathepsin D mRNA levels in oestrogen responsive 
human breast cancer cells. 

A; Reference number: S30749; MUID: 87231068; PMID:3588310 
A;Accession: S30749 
A;Molecule type: mRNA 
A; Residues: 1-412 <WES> 

A;Cross-references : EMBL:X05344; NID:g29677; PIDN: CAA28 955 . 1; PID:g29678 
R;May, F.E.B. ; Smith, D.J.; Westley, B.R. 
Gene 134, 277-282, 1993 

A; Title: The human cathepsin D-encoding gene is transcribed from an estrogen- 
regulated and a constitutive start point. 
A; Reference number: PC2066; MUID: 94085791; PMID: 8262386 
A; Accession: PC2066 
A;Molecule type: DNA 
A; Residues: 1-23 <MAY> 

A;Cross-references: GB:L12980; NID:g291930; PIDN : AAA16314 . 1; PID:g455429 
A; Experimental source: MCF-7 cell 
R;Cavailles, V.; Augereau, P.; Rochefort, H. 
Proc. Natl. Acad. Sci. U.S.A. 90, 203-207, 1993 

A; Title: Cathepsin D gene is controlled by a mixed promoter, and estrogens 
stimulate only TATA-dependent transcription in breast cancer cells. 
A; Reference number: 159236; MUID: 93126342 ; PMID: 8419924 
A;Accession: 159236 

A; Status: translation not shown; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-22 <CAV1> 

A;Cross-references: GB:S52557; NID:g263124; PIDN : AAD138 68 . 1; PID:g4261568 
R;Augereau, P.; Miralles, F. ; Cavailles, V.; Gaudelet, C; Parker, M. ; 
Rochefort, H. 

Mol. Endocrinol. 8, 693-703, 1994 

A;Title: Characterization of the proximal estrogen-responsive element of human 
cathepsin D gene. 

A;Reference number: 157716; MUID: 95021301; PMID:7935485 
A;Accession: 157716 

A; Status: translation not shown; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-22 <CAV2> 

A;Cross-references: GB:S74689; NID:g786350; PIDN: AAD14156. 1; PID:g4261856 



R;Baldwin, E.T.; Bhat, T.N.; Gulnik, S.; Erickson, J.W. 
submitted to the Brookhaven Protein Data Bank, April 1993 
A; Reference number: A51839; PDB: 1LYA 

A; Contents: annotation; X-ray crystallography, 2.5 angstroms, residues 65- 
161;170-241 

R;Baldwin, E.T.; Bhat, T.N.; Gulnik, S.; Erickson, J.W. 
submitted to the Brookhaven Protein Data Bank, April 1993 
A; Reference number: A51840; PDB : 1LYB 

A; Contents: annotation; X-ray crystallography, 2.5 angstroms, with inhibitor 
residues 65-161; 170-241 

R;Baldwin, E.T.; Bhat, T.N.; Gulnik, S.; Hosur, M.V. ; Sowder II, R.C.; Cachau, 
R.E.; Collins, J.; Silva, A.M.; Erickson, J.W. 
Proc. Natl. Acad. Sci. U.S.A. 90, 6796-6800, 1993 

A; Title: Crystal structures of native and inhibited forms of human cathepsin D: 

implications for lysosomal targeting and drug design. 

A; Reference number: A48229; MUID : 93342076 ; PMID: 8393577 

A; Contents: annotation; X-ray crystallography, 2.5 angstroms 

C; Comment: Cathepsin D is a ubiquitous lysosomal proteinase. 

C; Comment: In addition to the propeptide, residues 163-168 and 411-412 are 

proteolytically removed. Residues 169 and 170 are also partially removed. 

C; Comment: The carbohydrate bound to 134-Asn contains a mannose-6-phosphate that 

is bound near 267-Lys and the phosphotransferase recognition region. 

C; Genetics : 

A; Gene: GDB : CTSD 

A; Cross-references: GDB: 12 0512; OMIM: 116840 
A; Map position: llpl5 , 5-llpl5 . 5 
C; Function : 

A; Description : limited specificity endopeptidase 
A; Pathway: intracellular protein degradation 
C; Super family : pepsin 

C; Keywords: aspartic proteinase; glycoprotein; hydrolase; lysosome; protein 
degradation 

F; 1-20/Domain : signal sequence #status predicted <SIG> 

F; 21-64/Domain : propeptide #status predicted <PRO> 

F; 65-162, 169-410/Product: cathepsin D ffstatus experimental <MAT> 

F; 267, 329-356/Region: phosphotransferase recognition 

F; 91-160, 110-117,286-290, 329-366/Disulfide bonds: ftstatus experimental 
F; 97 , 295/Active site: Asp #status experimental 

F; 134 , 263/Binding site: carbohydrate (Asn) (covalent) #status experimental 

Query Match 11.5%; Score 308.5; DB 1; Length 412; 

Best Local Similarity 27.1%; Pred. No. 2.5e-15; 

Matches 121; Conservative 75; Mismatches 180; Indels 71; Gaps 22; 

Qy 9 LLP LLAQWL L RAAP E LAP AP FT L P LRVAAATN RWAP T P G PGTPAERHADGLAL 62 

I I I I I II I I I : I I : I : : I I : : : : 

Db 6 LLPLAL — CLLAAP — ASALVRIPLHKFTSIRRTMSEVGGSVEDLIAKGPVSKYSQAVPA 61 

Qy 63 ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTP 122 

I : I I : : I I I : I I I I I I : : I I I I I I I 

Db 62 VTEGPI — PEVLKNYM DAQYYGEIGIGTPPQCFTWFDTGSSNLWVPSIH 109 

Qy 123 HSYIDT YFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIP — KGFNTSFL 174 

: I : : : : : | | | | | : I | | : | : : : | | : : | : I I 

Db 110 CKLLDIACWIHHKYNSDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSAS SASAL 169 



Qy 



175 



— VNIATIFESENFFLPGI 



KWNGILGLAYATLAKPSSSLETFFDSLVTQANI-PN 226 



I : I III I : : I I I I : I I : : : : : I I : I : I : I 

Db 170 GGVKVERQVFGEATKQPGITFIAAKFDGILGMAYPRIS — WNVLPVFDNLMQQKLVDQN 227 

Qy 227 VFSMQMCGAGLPVAGS GTNGGS LVLGGI EPSLYKGDIWYT P I KEEWYYQI EILKLEI GGQ 286 

: I I I I I I : I I I : I I I : I : : I : I : : : : I : 

Db 228 IFSFY LSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEV-AS 281 

Qy 287 SLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWT 346 

111:1 : I I I I : I I : I : I III : I : : I 

Db 282 GLTL-CKE — GCEAIVDTGTSLMVGP VDEVRELQKAIGAVPLIQGEY MI PC — 32 9 

Qy 347 NSETPWSYFPKISIYLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRF GISPSTN 403 

I I I : : I : : : : : I : I : | | | I I : 

Db 330 EKVSTLPAITLKL GGKGYKLS — PEDYTLKVSQAGKTLCLSGFMGMDIPPPSG 380 

Qy 4 04 AL-VT GATVMEGFYVI FDRAQKRVGFA 429 

I : : I : : I : I I I I I I I I 

Db 381 PLWILGDVFIGRYYTVFDRDNNRVGFA 407 



RESULT 15 
KHMSD 

cathepsin D (EC 3.4.23.5) precursor - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 31-Dec-1991 (fsequence__revision 31-Dec-1991 #text_change 18-Jun-1999 
C;Accession: 148278; S14704; S12587 

R;Hetman, M. ; Perschl, A.; Saftig, P.; Von Figura, K. ; Peters, C. 
DNA Cell Biol. 13, 419-427, 1994 

A; Title: Mouse cathepsin D gene: molecular organization, characterization of the 
promoter, and chromosomal localization. 

A; Reference number: 148278; MUID : 94280622 ; PMID: 8011168 
A; Accession : 14827 8 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-410 <RES> 

A;Cross-references: EMBL:X68378; NID:g50302; PIDN : CAA4 84 53 . 1 ; PID:g817945 
R;Diedrich, J.F.; Staskus, K.A.; Retzel, E.F.; Haase, A.T, 
Nucleic Acids Res. 18, 7184, 1990 

A;Title: Nucleotide sequence of a cDNA encoding mouse cathepsin D. 
A; Reference number: S14704; MUID: 91088345; PMID:2263503 
A;Accession: S14704 
A;Molecule type: mRNA 
A; Residues: 1-410 <DIE> 

A;Cross-references: EMBL:X53337; NID:g50300; PIDN : CAA37423 . 1 ; PID:g50301 
R;Grusby, M.J.; Mitchell, S.C.; Glimcher, L.H. 
Nucleic Acids Res. 18, 4008, 1990 

A; Title: Molecular cloning of mouse cathepsin D. 

A; Reference number: S12587; MUID : 90326544 ; PMID:2374732 

A; Accession : S12587 

A; Molecule type: mRNA 

A; Residues: 1-410 <GRU> 

A;Cross-references: EMBL:X52886; NID:g50298; PIDN: CAA37 067 . 1; PID:g50299 
C; Genetics : 

A;Introns: 23/2; 76/3; 118/1; 157/3; 233/2; 274/2; 322/3; 355/3 
C; Function : 

A; Description : limited specificity endopeptidase 
A; Pathway: intracellular protein degradation 



C; Superfamily : pepsin 

C; Keywords: aspartic proteinase; glycoprotein; hydrolase; lysosome; protein 
degradation 

F; 1-20/Domain : signal sequence #status predicted <SIG> 
F; 2 1-64 /Domain: propeptide #status predicted <PRO> 

F; 65- 4 10/ Product : cathepsin D, single-chain form #status predicted <MAT> 
F; 91-160, 110-117, 284-288, 327-364/Disulf ide bonds: #status predicted 
F; 97, 293/Active site: Asp #status predicted 

F; 134, 261/Binding site: carbohydrate (Asn) (covalent) #status predicted 



Query Match 11.4%; Score 306.5; DB 1; Length 410; 

Best Local Similarity 27.5%; Pred. No. 3.6e-15; 

Matches 103; Conservative 64; Mismatches 123; Indels 85; Gaps 



15; 



Qy 



Db 



92 YYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDT YFDTERSSTYRSKGFDV 145 

I I :: I I I I I I :: I I I I I I I : I : : : : : I I I I I 

79 YYGDIGIGTPPQCFTWFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGTSF 138 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



14 6 TVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIAT IFESENFFLPGI 



KWNGIL 197 



: I I I : I : : : I I : : 



: I 



I 



I I 



139 DIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIF-GEATKQPGIVFVA7VKFDGIL 197 
198 G LAYAT LAK PSSSLETFFDS LVT QAN I - PN VFSMQMC GAGL P VAG S GTN GG S LVL GG I EP 256 



I : I 



I I : I : I 



I : I I 



198 GMGYPHIS — WNVLPVFDNLMQQKLVDKNIFSFY- 



I I llhlll: 

LNRDPEGQPGGELMLGGTDS 250 



257 SLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVF 316 



I I : : I : : I : I : : : I I : I : I I I : 



: I I I I : I I : I I I : 



251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNE-LTL-CK — GGCEAIVDTGTSLLVGPVEEV 306 
317 DAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITIL 376 



: : I : II 

307 KELQKAIGAVPLI 



* I * * * I 

* I * * * I 

■QGEYMIPCEKVSSL 333 



377 PQLYIQPMMGAGLNYEC YRFGIS PSTNALVIGATVMEG 414 

I :|:: :| I I I I I :| II ::| : 

334 PTVYLK— LG-GKNYELHPDKYILKVSQGGKTICLSGFMGMDIPPPSGPLWILGDVFIGS 390 

415 FYVI FDRAQKRVGFA 42 9 

: I : I I I I I I I I 

391 YYTVFDRDNNRVGFA 405 



Search completed: February 28, 2004, 09:11:59 
Job time : 19.5 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



Title: 
Perfect score: 2687 



February 28, 2004, 08:57:15 ; Search time 11.7727 Seconds 

(without alignments ) 
2291.087 Million cell updates/sec 

US-09-668-314C-2 



Sequence : 



1 MGALARALLLPLLAQWLLRA RPRDPEWNDESSLVRHRWK 518 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



141681 



Database : 



SwissProt 42:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
BAE2_HUMAN 

ID BAE2_HUMAN STANDARD; PRT; 518 AA. 

AC Q9Y5Z0; Q9UJT6; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Beta secretase 2 precursor (EC 3.4.23.45) (Beta-site APP-cleaving 

DE enzyme 2) (Aspartyl protease 1) (Asp 1) (ASP1) (Membrane-associated 

DE aspartic protease 1) (Memapsin-1) (Down region aspartic protease) . 

GN BACE2 OR ASP21. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20057170; PubMed=105912 13 ; 

RA Yan R. , Bienkowski M.J., Shuck M.E., Miao H., Tory M.C., Pauley A.M., 

RA Brashier J.R., Stratman N.C., Mathews W.R., Buhl A.E., Carter D.B., 

RA Tomasselli A.G., Parodi L.A., Heinrikson R.L., Gurney M.E.; 

RT "Membrane-anchored aspartyl protease with Alzheimer 1 s disease 

RT beta-secretase activity."; 

RL Nature 402:533-537(1999). 



RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Bone marrow; 

RA Xin H., Stephans J.C., Duan X., Harrowe G., Kim E., Grieshammer U., 

RA Giese K. ; 

RT "Identification of a novel aspartic-like protease differentially 

RT expressed in human breast cancer cell lines."; 

RL Submitted (JAN-1999) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Accarino M.P., Fumagalli P., Ottolenghi S., Taramelli R. ; 

RT "Cloning of a gene from chromosome 21 Down region encoding a potential 

RT transmembrane aspartyl protease."; 

RL Submitted (FEB-1998) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Solans A., Estivill X., de la Luna S.; 

RT "Cloning of a novel mammalian aspartyl protease."; 

RL Submitted (AUG-1999) to the EMBL/GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 0120043; PubMed= 10656250; 

RA Hussain I., Powell D.J., Howlett D.R., Tew D.G., Meek T.D., 

RA Chapman C, Gloger I.S., Murphy K.E., Southan CD., Ryan D.M., 

RA Smith T.S., Simmons D.L., Walsh F.S., Dingwall C, Christie G. ; 

RT "Identification of a novel aspartic proteinase (Asp 2) as 

RT beta-secretase . " ; 

RL Mol. Cell. Neurosci. 14:419-427(1999). 

RN [6] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20144060; PubMed=10677483; 

RA Lin X., Koelsch G., Wu S., Downs D., Dashti A. , Tang J. ; 

RT "Human aspartic protease memapsin 2 cleaves the beta-secretase site of 

RT beta-amyloid precursor protein."; 

RL Proc. Natl. Acad. Sci. U.S.A. 97:1456-1460(2000). 

RN [7] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2 0289799; PubMed= 10830953; 

RA Hattori M., Fujiyama A., Taylor T.D., Watanabe H., Yada T., 

RA Park H.-S., Toyoda A., Ishii K. , Totoki Y. , Choi D.-K., Groner Y., 

RA Soeda E. , Ohki M. , Takagi T., Sakaki Y. , Taudien S., Blechschmidt K. f 

RA Polley A., Menzel U., Delabar J., Kumpf K. , Lehmann R., Patterson D. f 

RA Reichwald K., Rump A., Schillhabel M. r Schudy A., Zimmermann W. , 

RA Rosenthal A., Kudoh J., Shibuya K. , Kawasaki K., Asakawa S., 

RA Shintani A., Sasaki T., Nagamine K. , Mitsuyama S . , Antonarakis S.E., 

RA Minoshima S., Shimizu N., Nordsiek G. , Hornischer K., Brandt P., 

RA Scharfe M. f Schoen O., Desario A. f Reichelt J. f Kauer G. , Bloecker H. , 

RA Ramser J., Beck A., Klages S., Hennig S., Riesselmann L. , Dagand E., 

RA Wehrmeyer S. f Borzym K., Gardiner K. f Nizetic D., Francis F., 

RA Lehrach H., Reinhardt R. , Yaspo M.-L.; 

RT "The DNA sequence of human chromosome 21."; 

RL Nature 405:311-319(2000). 

RN [8] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Skin; 

RX MEDLINE-22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 



RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J. , Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C. f Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A. r Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [9] 

RP CHARACTERIZATION. 

RX MEDLINE=22088158; PubMed=12 093293 ; 

RA Turner R.T. Ill, Loy J. A., Nguyen C. r Devasamudram T., Ghosh A.K., 

RA Koelsch G. , Tang J. ; 

RT "Specificity of memapsin 1 and its implications on the design of 

RT memapsin 2 (beta-secretase ) inhibitor selectivity. 11 ; 

RL Biochemistry 41:8742-8746(2002). 

CC -!- CATALYTIC ACTIVITY: Broad endopeptidase specificity. Cleaves Glu- 

CC Val-Asn-Leu- | -Asp-Ala-Glu-Phe in the Swedish variant of 

CC Alzheimer f s amyloid precursor protein. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- SIMILARITY: Belongs to peptidase family Al . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF200342; AAF17078.1; -. 

DR EMBL; AF117892; AAD45240.1; -. 

DR EMBL; AF050171; AAD45963.1; 

DR EMBL; AF178532; AAF29494.1; -. 

DR EMBL; AF204944; AAF26368.1; -. 

DR EMBL; AF200192; AAF13714.1; 

DR EMBL; AL163284; CAB90458.1; -. 

DR EMBL; AL163285; CAB90554.1; -. 

DR EMBL; BC014453; AAH14453.1; -. 

DR HSSP; P00797; 2 REN . 

DR MEROPS; A01.041; 

DR Genew; HGNC:934; BACE2 . 

DR MIM; 605668; -. 

DR GO; GO: 0005624; C:membrane fraction; TAS . 

DR GO; GO: 0004190; F: aspartic-type endopeptidase activity; TAS. 

DR GO; GO:0006464; P: protein modification; TAS. 



DR 


GO; GO:0009306; P: 


protein 


secretion; TAS. 


DR 


InterPro ; 


IPR001969; Aspprotease AS. 


LJ L\ 


InterPro; 


IPR009007; Pept 


A 


acid. 


DR 


InterPro; 


IPR001461; Peptidase Al . 


DR 


Pfam; PF00026; asp 


• i 

f -L » 






DR 


PROSTTF, • 


PS00141; 


ASP_PROTEASE; 2. 


rvw 


R\/H T"ol 3 <Z (=i 


; Aspartyl protease 


; Glycoprotein; Zymogen; Transmembrane; 


"KTAT 
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BETA SECRETASE 2. 
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DOMAIN 


21 
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EXTRACELLULAR (POTENTIAL) . 


FT 
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POTENTIAL. 


FT 
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CYTOPLASMIC (POTENTIAL) . 


FT 


ACT SITE 


110 


110 




BY SIMILARITY. 


FT 


ACT SITE 


303 


303 




BY SIMILARITY. 


FT 


CARBOHYD 


170 


170 




N-LINKED ( GLCNAC . . .) (POTENTIAL). 


FT 


CARBOHYD 


366 


366 




N-LINKED (GLCNAC. . .) (POTENTIAL) . 


FT 


CONFLICT 


36 
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A -> T (IN REF. 6) . 


SQ 


SEQUENCE 


518 AA; 


56180 


MW; 2E903150823760D3 CRC64; 


Query Match 




100. 


0%; 


Score 2687; DB 1; Length 518; 


Best Local Similarity 


100. 


0%; 


Pred. No. 2e-187; 



Matches 518; Conservative 



0; Mismatches 



0; Indels 



0; Gaps 



0; 
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Db 
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1 MGALARALLLPL]^QWLLRAAPE]^PAPFTLPLRV7y\ATNRVVAPTPGPGTPAERHADGL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I 

1 MGALARAL L L P L LAQWL L RAAP E LAP AP FT L P LRVAAATN RWAP T P G P GT P AERHAD GL 60 

61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I II I I I I I I I II I I II I I I I I 

61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTSFLVNIATI 180 

I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | M | | | | | | 
121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTSFLVNIATI 180 

181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I M I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I M | | | | | | | | 
181 FESENFFLPGIKWNGILGLAYATL7VKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYN.ADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 



301 



360 



IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I 

301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTN7VLVIGATVMEGFYVIFD 42 0 

421 RAQKRVGFAAS P CAE I AGAAVS E I S G P FS T EDVASNCVPAQS L S E P I LWI VS YALMS VCG 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

421 RAQKRVGFAAS PC7VEI AGAAVS EI SGPFSTEDVASNCVPAQSLSEPI LWI VS YALMS VCG 480 



Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 481 AI L L VL I VL L L L P FRCQRR P RD P E WN D E S S L VRH RWK 518 



RESULT 2 
BACE RAT 



ID BACE_RAT STANDARD; PRT; 501 AA. 

AC P56819; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last' annotation update) 

DE Beta-secretase precursor (EC 3.4.23.46) (Beta-site APP cleaving 

DE enzyme) (Beta-site amyloid precursor protein cleaving enzyme) 

DE (Aspartyl protease 2) (Asp 2) (ASP2) (Membrane-associated aspartic 

DE protease 2) (Memapsin-2 ) . 

GN BACE . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20002 972; PubMed=10531052 ; 

RA Vassar R., Bennett B.D., Babu-Khan S., Kahn S., Mendiaz E.A., 

RA Denis P., Teplow D.B., Ross S., Amarante P., Loeloff R. , Luo Y., 

RA Fisher S., Fuller J-, Edenson S., Lile J., Jarosinski M.A., 

RA Biere A.L., Curran E., Burgess T., Louis J.-C, Collins F., 

RA Treanor J., Rogers G., Citron M. ; 

RT "Beta-secretase cleavage of Alzheimer 1 s amyloid precursor protein by 

RT the transmembrane aspartic protease BACE."; 

RL Science 28 6:7 35-741(1999). 

CC -!- FUNCTION: Responsible for the proteolytic processing of the 

CC amyloid precursor protein (APP) . Cleaves at the amino terminus of 

CC the A-beta peptide sequence, between residues 671 and 672 of APP, 

CC leads to the generation and extracellular release of beta-cleaved 

CC soluble APP, and a corresponding cell-associated carboxy-terminal 

CC fragment which is later release by gamma-secretase (By 

CC similarity) . 

CC -!- CATALYTIC ACTIVITY: Broad endopeptidase specificity. Cleaves Glu- 

CC Val-Asn-Leu- | -Asp-Ala-Glu-Phe in the Swedish variant of 

CC Alzheimer's amyloid precursor protein. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- SIMILARITY: Belongs to peptidase family Al . 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF190727; AAF04144.1; -. 

DR HSSP; P32329; 1YPS. 

DR MEROPS; A01.004; -. 

DR InterPro; IPR001969; Aspprotease AS. 



DR 
DR 
DR 
DR 
DR 
KW 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



InterPro; IPR009007; Pept_A_acid. 
InterPro; IPR001461; Peptidase_Al . 
Pfam; PF00026; asp; 1. 
PRINTS ; PR00792; PEPSIN. 
PROSITE; PS00141; ASP_PROTEASE ; 1. 

Hydrolase; Aspartyl protease; Glycoprotein; Zymogen; Transmembrane; 
Signal . 



SIGNAL 

PROPEP 

CHAIN 

DOMAIN 

TRANSMEM 

DOMAIN 

ACT_SITE 

ACT_SITE 

DISULFID 

DISULFID 

DISULFID 

CARBOHYD 

CARBOHYD 

CARBOHYD 

CARBOHYD 

SEQUENCE 



1 

22 

46 

22 
458 
479 

93 
289 
216 
278 
330 
153 
172 
223 
354 
501 AA; 



21 

45 
501 
457 
478 
501 

93 
289 
420 
443 
380 
153 
172 
223 
354 

55806 



POTENTIAL . 

POTENTIAL. 

BETA-SECRETASE. 

EXTRACELLULAR (POTENTIAL) 

POTENTIAL. 

CYTOPLASMIC (POTENTIAL) . 
BY SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
N-LINKED (GLCNAC. . 
N-LINKED (GLCNAC. . 
N-LINKED (GLCNAC. . 
N-LINKED (GLCNAC. . 



BY 
BY 
BY 
BY 



(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 



MW; 24B445BC8BE87DE3 CRC64; 



Query Match 44.2%; Score 1187; DB 1; Length 501; 

Best Local Similarity 46.4%; Pred. No. 1.2e-78; 

Matches 24 0; Conservative 82; Mismatches 165; Indels 30; 



Gaps 



9; 



QY 
Db 



9 L L P L LAQ WL L RAAP E LAP AP FT LPLRVAAATNRWAPTPGP — GTPAERHADGLA 61 

: I I II : I I I I I I I I III II 

1 MAPALRWLLLWVGSGMLPAQGTHLGIRLPLRSGLA GPPLGLRLPRETDE — 49 



QY 
Db 

QY 
Db 

QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



62 LALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGT 121 



I I 



I 



I I I I I : I I I : I I I : I I : I : I I I I I I I I I I I I I II I 



50 EP — E E P GRRG S FVEMVDN L RG K S GQ G Y YVEMT VG S P PQT LN I L VDT G S S N FAVGAA 104 

122 PHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIF 181 

II:: I : : I I I I I I I I I I I I I : I I I I : I I I I : I II I 

105 PHPFLHRYYQRQLSSTYRDLRKSVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAIT 164 

182 ESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV — 239 

II: II: I I I I I I I I I I : I : I I II I I I I I I I : I I I : I I : I : I I I I I : 
165 ESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQ 224 

240 -AGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNAD 298 



I I 



I 



III I : I I I I I : I I I I 



II II I : I I : I I I I 



225 TEALASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYD 284 

299 KAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKI 358 

I : I I I I I I I I I I I : I I I : I I : : : II : I I I I I I I I I I I I I : II I 

285 KS I VDSGTTNLRLPKKVFEAAVKS I KAAS STEKFPDGFWLGEQLVCWQAGTTPWNI FPVI 344 

359 S I YLRDENS S RS FRI T I LPQLYI QPMMGAGLN Y-EC YRFGI S PSTNALVT GATVMEGFYV 417 



I : I I I ::: I I I I I I I I I I : : I : 



I I 



I I 



I : I I : I I I I I I 



345 SLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYV 4 04 



Qy 



418 



I FD RAQ K RVG FAAS P CAE I AGAAVS E I S G P F S T EDVAS N C VPAQ S L S E P I LW I VS YALMS 477 



Db 4 05 VFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTADMEDCGYNIPQTDESTLMTIAYVMAA 4 64 



Qy 478 VCGAILLVLIVLLLLPFRCQR — RPRDPEWNDESSL 512 

: I I : : : : I : : : I I I I : : : I I I 

Db 4 65 IC-ALFMLPLCLMVCQWRCLRCLRHQHDDFADDISLL 500 



RESULT 3 
BACE MOUSE 



ID BACE_MOUSE STANDARD; PRT; 501 AA. 

AC P56818; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Beta-secretase precursor (EC 3.4.23.46) (Beta-site APP cleaving 

DE enzyme) (Beta-site amyloid precursor protein cleaving enzyme) 

DE (Aspartyl protease 2) (Asp 2) (ASP2) (Membrane-associated aspartic 

DE protease 2) (Memapsin-2) . 

GN BACE * 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20002972; PubMed=10531052 ; 

RA Vassar R., Bennett B.D., Babu-Khan S., Kahn S., Mendiaz E.A. , 

RA Denis P., Teplow D.B., Ross S., Amarante P., Loeloff R. , Luo Y., 

RA Fisher S., Fuller J., Edenson S., Lile J., Jarosinski M.A. , 

RA Biere A.L., Curran E. , Burgess T., Louis J.-C, Collins F. , 

RA Treanor J. , Rogers G. , Citron M. ; 

RT "Beta-secretase cleavage of Alzheimer 1 s amyloid precursor protein by 

RT the transmembrane aspartic protease BACE."; 

RL Science 286:735-741(1999). 

RN [2] 

RP REVISIONS TO 6 AND 81-87. 

RA Bennett B.D., Vassar R. , Citron M. ; 

RL Submitted (JAN-2000) to the EMBL/ GenBank/ DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=2005717 0; PubMed=10591213 ; 

RA Yan R., Bienkowski M.J., Shuck M.E., Miao H., Tory M.C., Pauley A.M., 

RA Brashier J.R., Stratman N.C., Mathews W.R., Buhl A.E., Carter D.B., 

RA Tomasselli A.G., Parodi L.A., Heinrikson R.L., Gurney M.E.; 

RT "Membrane-anchored aspartyl protease with Alzheimer* s disease 

RT beta-secretase activity."; 

RL Nature 402:533-537(1999). 

CC -!- FUNCTION: Responsible for the proteolytic processing of the 

CC amyloid precursor protein (APP) . Cleaves at the amino terminus of 

CC the A-beta peptide sequence, between residues 671 and 672 of APP, 

CC leads to the generation and extracellular release of beta-cleaved 

CC soluble APP, and a corresponding cell-associated carboxy-terminal 

CC fragment which is later release by gamma-secretase (By 

CC similarity) . 

CC -!- CATALYTIC ACTIVITY: Broad endopeptidase specificity. Cleaves Glu- 
CC Val-Asn-Leu- | -Asp-Ala-Glu-Phe in the Swedish variant of 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



Alzheimer's amyloid precursor protein. 

- SUBCELLULAR LOCATION: Type I membrane protein 

- TISSUE SPECIFICITY: Brain. 

- SIMILARITY: Belongs to peptidase family Al . 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; AF190726; AAF04143.2; 

EMBL; AF200346; AAF17082.1; -. 

HSSP; P56272; 1AM5 . 

MEROPS; A01.004; -. 

MGD; MGI: 1346542; Bace. 

InterPro; IPR001969; Aspprotease_AS . 

InterPro; IPR009007; Pept_A_acid. 

InterPro; IPR001461; Peptidase_Al . 

Pfam; PF00026; asp; 1. 

PRINTS; PR00792; PEPSIN. 

PROSITE; PS00141; ASP_PROTEASE; 1. 

Hydrolase; Aspartyl protease; Glycoprotein; Zymogen; Transmembrane; 
Signal . 



SIGNAL 

PROPEP 

CHAIN 

DOMAIN 

TRANSMEM 

DOMAIN 

ACTJ3ITE 

ACT_SITE 

DISULFID 

DISULFID 

DISULFID 

CARBOHYD 

CARBOHYD 

CARBOHYD 

CARBOHYD 

SEQUENCE 



1 

22 

46 

22 
458 
479 

93 
289 
216 
278 
330 
153 
172 
223 
354 
501 AA; 



21 
45 
501 
457 
478 
501 
93 
289 
420 
443 
380 
153 
172 
223 
354 



POTENTIAL . 

POTENTIAL. 

BETA- S ECRETAS E . 

EXTRACELLULAR (POTENTIAL) . 

POTENTIAL. 

CYTOPLASMIC (POTENTIAL) . 
BY SIMILARITY. 
SIMILARITY. 
SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
N-LINKED ( GLCNAC . . 
N-LINKED (GLCNAC. . 
N-LINKED (GLCNAC. . 
N-LINKED (GLCNAC. . 



BY 
BY 



(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 
(POTENTIAL) 



55747 MW; C085A013145E474E CRC64; 



Query Match 44.1%; Score 1185; DB 1; Length 501; 

Best Local Similarity 46.0%; Pred. No. 1.7e-78; 

Matches 237; Conservative 83; Mismatches 169; Indels 26; Gaps 



7; 



Qy 

Db 



9 LLPLLAQWLLRAAPELAPAPFT 



I I 



I I 



LPLRVAAATNRWAPTPGPGTPAERHADGLALA 63 
I I I I I II II : 



1 MAPALHWLLLWVGSGMLPAQGTHLGIRLPLRSGLA GPPLGLRLPRETDEES 51 



Qy 

Db 



64 LEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPH 123 

I : I : I I I I I : I I I : I I I : I I : I : I I I I I I I II I I I I I I I I I 

52 EEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVGAAPH 106 



Qy 



124 SYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFES 183 



I I I 



I I I I I I I I : I I I I : I I I I 



III I II 



Db 



107 PFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNA/TVRANIAAITES 166 



Qy 184 ENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV A 240 

: I I : I I I I I I I I I I : I : I I II I I I I I I I : I I I : I I : I : I I I I I : 

Db 167 DKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTE 226 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

: I I I :: : I I I : III I : I I I I I : I I I I :: I : :: I I I I I : I I : I I I I I : 

Db 227 ALASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKS 286 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I I I I I I I I : I I I : I I :: : II : I I I I I I I I I I I I I : I I I I : 

Db 287 IVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISL 346 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNY-ECYRFGISPSTNALVIGATVMEGFYVIF 419 

II I : : : I I I I I I I I I I : : I : : : I I : I : I I : I : I I : I II I I I : I 

Db 347 YLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWF 406 

Qy 420 D RAQ KRVG FAAS P C AE I AGAAVS E I S G P FS T EDVASN C VP AQ S L S E P I LW I VS YALMS VC 479 

I I I : I I : I I I I I : : I I I I I : I I : : I : : : I 

Db 4 07 DRARKRIGFAVSACHVHDEFRTAAVEGPFVTADMEDCGYNIPQTDESTLMTIAYVMAAIC 466 

Qy 480 GAILLVLIVLLLLPFRCQR — RPRDPEWNDESSL 512 

I : : : : I : : : I I I I : : : I I I 

Db 4 67 -ALFMLPLCLMVCQWRCLRCLRHQHDDFADDISLL 500 



RESULT 4 
B AC E_HUMAN 

ID BACE_HUMAN STANDARD; PRT; 501 AA. 

AC P56817; Q9BYB9; Q9BYC0; Q9BYC1; Q9UJT5; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Beta-secretase precursor (EC 3.4.23.46) (Beta-site APP cleaving 

DE enzyme) (Beta-site amyloid precursor protein cleaving enzyme) 

DE (Aspartyl protease 2) (Asp 2) (ASP2) (Membrane-associated aspartic 

DE protease 2) (Memapsin-2) . 

GN BACE OR BACE1. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM A) . 

RC TISSUE=Brain; 

RX MEDLINE=20002972; PubMed=10531052 ; 

RA Vassar R. , Bennett B.D., Babu-Khan S., Kahn S., Mendiaz E.A., 

RA Denis P., Teplow D.B., Ross S., Amarante P., Loeloff R. , Luo Y., 

RA Fisher S., Fuller J., Edenson S., Lile J., Jarosinski M.A., 

RA Biere A.L., Curran E., Burgess T., Louis J.-C, Collins F. , 

RA Treanor J., Rogers G. , Citron M. ; 

RT "Beta-secretase cleavage of Alzheimer's amyloid precursor protein by 

RT the transmembrane aspartic protease BACE."; 

RL Science 286:735-741 (1999) . 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM A), SEQUENCE OF 46-68, AND 



RP CHARACTERIZATION. 

RC TISSUE=Brain; 

RX MEDLINE=2 0057171; PubMed= 10591214; 

RA Sinha S. f Anderson J. P., Barbour R. , Basi G.S., Caccavello R. , 

RA Davis D., Doan M. , Dovey H.F., Frigon N . , Hong J., Jacobson-Croak K., 

RA Jewett N. , Keim P., Knops J., Lieberburg I., Power M. , Tan H., 

RA Tatsuno G. , Tung J., Schenk D., Seubert P . , Suomensaari S.M., Wang S. f 

RA Walker D. , Zhao J., McConlogue L., Varghese J.; 

RT "Purification and cloning of amyloid precursor protein beta-secretase 

RT from human brain."; 

RL Nature 402:537-540(1999). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM A) . 

RX MEDLINE=20057170; PubMed-105912 13 ; 

RA Yan R., Bienkowski M.J., Shuck M.E., Miao H. , Tory M.C., Pauley A.M., 

RA Brashier J.R., Stratman N.C., Mathews W.R., Buhl A.E., Carter D.B., 

RA Tomasselli A.G., Parodi L.A., Heinrikson R.L., Gurney M.E.; 

RT "Membrane-anchored aspartyl protease with Alzheimer 1 s disease beta- 

RT secretase activity."; 

RL Nature 4 02:533-537(1999). 

RN [4] 

RP SEQUENCE FROM N.A. (ISOFORM A) . 

RX MEDLINE=2 0120043; PubMed=l 0 65 62 5 0 ; 

RA Hussain I., Powell D.J., Howlett D.R., Tew D.G., Meek T.D., 

RA Chapman C, Gloger I.S., Murphy K.E., Southan CD., Ryan D.M., 

RA Smith T.S., Simmons D.L., Walsh F.S., Dingwall C, Christie G. ; 

RT "Identification of a novel aspartic proteinase (Asp 2) as beta- 

RT secretase."; 

RL Mol. Cell. Neurosci. 14:419-427(1999). 

RN [5] 

RP SEQUENCE FROM N.A. (ISOFORM B) . 

RC TISSUE==Brain, and Pancreas; 

RA Michel B., De Pietri Tonelli D., Zacchetti D., Keller P.; 

RT "New beta-site APP cleaving enzyme isoform (BACE-1B) obtained from 

RT human brain and pancreas."; 

RL Submitted (JAN-2001) to the EMBL/GenBank/DDBJ databases. 

RN [6] 

RP SEQUENCE FROM N.A. (ISOFORM C) . 

RC TISSUE=Pancreas; 

RA Zacchetti D., De Pietri Tonelli D., Schnurbus R.; 

RT "New beta-site APP cleaving enzyme isoform (BACE-1C) obtained from 

RT human pancreas . " ; 

RL Submitted (JAN-2001) to the EMBL/GenBank/DDBJ databases. 

RN [7] 

RP SEQUENCE FROM N.A. (ISOFORMS B; C AND D) . 

RC TISSUE=Brain; 

RX MEDLINE=2 1408 467; PubMed=l 1 516562; 

RA Tanahashi H., Tabira T . ; 

RT "Three novel alternatively spliced isoforms of the human beta-site 

RT amyloid precursor protein cleaving enzyme (BACE) and their effect on 

RT amyloid beta-peptide production."; 

RL Neurosci. Lett. 307:9-12(2001). 

RN [8] 

RP SEQUENCE OF 14-501 FROM N.A. (ISOFORM A), AND CHARACTERIZATION. 

RX MEDLINE=20144060; Pub'Med=106774 83; 

RA Lin X., Koelsch G., Wu S., Downs D., Dashti A. , Tang J.; 

RT "Human aspartic protease memapsin 2 cleaves the beta-secretase site of 



RT beta-amyloid precursor protein."; 

RL Proc. Natl. Acad. Sci. U.S.A. 97:1456-1460(2000). 

RN [9] 

RP DISULFIDE BONDS. 

RX MEDLINE=2 19508 60; PubMed=11953458 ; 

RA Fischer F. , Molinari M. , Bodendorf U . , Paganetti P.; 

RT "The disulphide bonds in the catalytic domain of BACE are critical but 

RT not essential for amyloid precursor protein processing activity."; 

RL J. Neurochem. 80:1079-1088(2002). 

CC -!- FUNCTION: Responsible for the proteolytic processing of the 

CC amyloid precursor protein (APP) . Cleaves at the amino terminus of 

CC the A-beta peptide sequence, between residues 671 and 672 of APP, 

CC leads to the generation and extracellular release of beta-cleaved 

CC soluble APP, and a corresponding cell-associated carboxy-terminal 

CC fragment which is later release by gamma-secretase . 

CC -!- CATALYTIC ACTIVITY: Broad endopeptidase specificity. Cleaves Glu- 

CC Val-Asn-Leu- | -Asp-Ala-Glu-Phe in the Swedish variant of 

CC Alzheimer's amyloid precursor protein. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=4; 

CC Name=A; Synonyms=BACE-lA, BAC-501; 

CC IsoId=P56817-l; Sequence=Displayed; 

CC Name=B; Synonyms =BACE- IB, BACE-I-476; 

CC IsoId=P56817-2; Sequence=VSP_005223 ; 

CC Name=C; Synonyms=BACE-lC, BACE-I-457; 

CC IsoId=P56817-3; Sequence=VSP__005222 ; 

CC Name=D; Synonyms=BACE-lD, BACE-I-4 32; 

CC IsoId=P56817-4; Sequence=VSP_005222 , VSP_005223; 

CC -!- TISSUE SPECIFICITY: Brain. 

CC -!- SIMILARITY: Belongs to peptidase family Al . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation — 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

DR EMBL; AF190725; AAF04142.1; 

DR EMBL; AF201468; AAF18982.1; -. 

DR EMBL; AF200343; AAF17079.1; -. 

DR EMBL; AF204943; AAF26367.1; -. 

DR EMBL; AF338816; AAK38374.1; 

DR EMBL; AF338817; AAK38375.1; 

DR EMBL; AB050436; BAB40931.1; -. 

DR EMBL; AB050437; BAB40932.1; -. 

DR EMBL; AB050438; BAB40933.1; -. 

DR EMBL; AF200193; AAF13715.1; -. 

DR PIR; A59090; A59090. 

DR PDB; 1M4H; 28-AUG-02. 

DR MEROPS; A01.004; 

DR Genew; HGNC:933; BACE. 

DR MIM; 604252; -. 

DR GO; GO: 0005887; C:integral to plasma membrane; TAS . 

DR GO; GO: 0008798; F:beta-aspartyl-peptidase activity; TAS. 



DR 


GO; GO:0009405; P: 


pathogenesis; TAS . 


nn 
JJK 


GO; GO:0006508; P: 


proteolysis 


; and peptidolysis ; TAS. 


DK 


InterPro; 


IPR001969; Aspprotease_AS . 


DR 


InterPro; 


IPR009007; Pept A_acid. 


DR 


InterPro; 


IPR001461; Peptidase Al . 


DR 


Pfam; PF00026; asp 


■; 1. 




DR 


PRINTS; PR00792; PEPSIN. 




DR 


PROSITE; 


PS00141; 


ASP PROTEASE; 1. 


KW 


Hydrolase 


; Aspartyl protease; 


Glycoprotein; zymogen; i ransmemorane, 


KW 


Signal ; Alternative splicing; 


3D— structure . 


FT 


SIGNAL 


1 


2 1 


POTENTIAL . 


FT 


PROPEP 


22 


45 




FT 


CHAIN 


46 


501 


BETA- S ECRETAS E . 


FT 


DOMAIN 


22 


457 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRANSMEM 


458 


478 


POTENTIAL . 


FT 


DOMAIN 


479 


501 


CYTOPLASMIC (POTENTIAL) . 


FT 


ACT_SITE 


93 


93 


BY SIMILARITY. 


FT 


ACT_SITE 


289 


289 


BY SIMILARITY. 


FT 


DISULFID 


216 


420 




FT 


DISULFID 


278 


443 




FT 


DISULFID 


330 


380 




FT 


CARBOHYD 


153 


153 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


172 


172 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


223 


223 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


354 


354 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


VARSPLIC 


146 


189 


Missing (in isoform C and isoform D) . 


FT 








/FTId=VSP_005222 . 


FT 


VARSPLIC 


190 


214 


Missing (in isoform B and isoform D) . 


FT 








/FTId=VSP 005223. 


SQ 


SEQUENCE 


501 AA; 


55763 MW; 


377CE4C824ACEF05 CRC64; 



Query Match 43.9%; Score 1178.5; DB 1; Length 501; 

Best Local Similarity 46.2%; Pred. No. 5.1e-78; 

Matches 240; Conservative 82; Mismatches 164; Indels 33; Gaps 9; 

Qy 7 ALLLPLLAQWLLRAAPELAPAPFT LPLRVAAATNRWAPTPGPGTPAERHADGLA 61 

I I I I I : : I I I I I I I II M 

Db 2 AQALPWLLLWM GAGVLPAHGTQHGIRLPLRSGLG GAPL GLR 42 

Qy 62 LALE — PALAS PAGAANFLAMVDNLQGDS GRGYYLEMLI GT P PQKLQI LVDTGS SN FAVA 119 

I I I : I : I I I II : I I I : II I : I I : I : I I I I I I I I I I I I I I I I 

Db 43 LPRETDEEPEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVG 102 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



120 GTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIAT 179 

II:: I : : I I I I I I I I \ I I I I : I I I I : I I I I : III 

103 TVAPHPFLHRYYQRQLSSTYRDLRKGVTVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAA 162 

18 0 IFESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV 239 

I II: II: I I I I I I I I I I : I : I III I I I II I I : : I I : I I : I : I I I I I : 
163 ITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPL 222 

240 AGS GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYN 2 96 

I : I I I : : : I I I : III I : I I I I I : MM:: | : : : | | I I I : II : I I I 

223 NQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYN 282 

297 ADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFP 356 
I I : I I II I I I I M I : I II : I I : : : II : I I I I I I I I I I I I I : I I 



Db 



283 



YDKS I VDSGTTNLRLPKKVFEAAVKS I KAAS STEKFPDGFWLGEQLVCWQAGTTPWNI FP 342 



Qy 357 KI S I YLRDENS SRS FRITI LPQLYIQPMMGAGLN Y-ECYRFGI S PSTNALVIGATVMEGF 415 

I I : I I I : : : I I I I II I I I I : : I : : : I I : I II I : I : I I : I I I I 

Db 343 VI S LYLMGEVTNQ S FRI T I LPQQ YLRPVEDVAT SQDDC YKFAI SQS STGTVMGAVIMEGF 402 

Qy 416 YVI FD RAQ KRVG FAAS P C AE I AGAAVS EISGPFSTE DVAS NCVP AQ S L S E P I LWI VS YAL 475 

I I : I I I I : I I : I I I I I : : I I I I I : I I : : | : 

Db 403 YWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQTDESTLMTIAYVM 462 

Qy 476 MSVCGAI LLVLI VLLLLPFRCQR — RPRDPEWNDESSL 512 

: : I I : : : : I : : : I I I I : : : I I I 

Db 4 63 AAIC-ALFMLPLCLMVCQWRCLRCLRQQHDDFADDISLL 500 



RESULT 5 
PEPC_MACFU 

ID PEPC_MACFU STANDARD; PRT; 377 AA. 

AC P03955; 

DT 23-OCT-1986 (Rel. 02, Created) 

DT 01-AUG-1992 (Rel. 23, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Gastricsin precursor (EC 3.4.23.3) (Pepsinogen C) (Fragment). 

GN PGC . 

OS Macaca fuscata fuscata (Japanese macaque) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9543; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Gastric mucosa; 

RX MEDLINE=92037645; PubMed=1935977 ; 

RA Kageyama T., Tanabe K. , Koiwai O. ; 

RT "Development-dependent expression of isozymogens of monkey 

RT pepsinogens and structural differences between them."; 

RL Eur. J. Biochem. 202:205-215(1991). 

RN [2] 

RP SEQUENCE OF 6-377. 

RX MEDLINE=86168133; PubMed=3514 597 ; 

RA Kageyama T., Takahashi K. ; 

RT "The complete amino acid sequence of monkey progastricsin . " ; 

RL J. Biol. Chem. 261:4406-4419(1986). 

RN [3] 

RP SEQUENCE OF 6-65. 

RX MEDLINE=85289106; PubMed-392 8 607 ; 

RA Kageyama T., Takahashi K. ; 

RT "Monkey pepsinogens and pepsins. VII. Analysis of the activation 

RT process and determination of the NH2-terminal 60-residue sequence of 

RT Japanese monkey progastricsin, and molecular evolution of 

RT pepsinogens."; 

RL J. Biochem. 97:1235-12 4 6(198 5). 

CC -!- CATALYTIC ACTIVITY: More restricted specificity than pepsin A, but 
CC shows preferential cleavage at Tyr- | -Xaa bonds; high activity 

CC towards hemoglobin as substrate. 

CC -!- PTM: Each pepsinogen is converted to corresponding pepsin at pH 
CC 2.0 in part as a result of the release of a 47 aa activation 
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cc 
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cc 
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cc 
cc 
cc 
cc 
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DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 



- ) - 



_ i - 



segment and in part as a result of stepwise proteolytic cleavage 
via an intermediate form(s) . 

MISCELLANEOUS: The expression of pepsinogen genes is regulated by 
hormones and related substances. 
SIMILARITY: Belongs to peptidase family Al. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; X59754; CAA42426.1; 
PIR; S19683; PEMQCJ. 
HSSP; P20142; 1AVF. 
MEROPS; A01.003; -. 

InterPro; IPR001969; Aspprotease_AS . 

InterPro; IPR009007; Pept_A_acid. 

InterPro; IPR001461; Peptidase_Al . 

Pfam; PF00026; asp; 1. 

PRINTS; PR00792; PEPSIN. 

PROSITE; PS00141; ASP PROTEASE; 2. 



KW 


Hydrolase ; 


Aspartyl 


. protease 


; Zymogen; Digestion; 


Signal 


FT 


NON_TER 


1 


1 






FT 


SIGNAL 


<1 


5 






FT 


PROPEP 


6 


31 


ACTIVATION PEPTIDE. 




FT 


PROPEP 


32 


48 


ACTIVATION PEPTIDE. 




FT 


CHAIN 


49 


377 


GASTRICSIN. 




FT 


DISULFID 


93 


98 






FT 


DISULFID 


256 


260 






FT 


DISULFID 


299 


332 






FT 


ACTJSITE 


80 


80 






FT 


ACT SITE 


265 


265 






FT 


CONFLICT 


331 


331 


Y -> V (IN REF. 2) . 




FT 


CONFLICT 


349 


349 


L -> LVY (IN REF. 2) 


* 


SQ 


SEQUENCE 


377 AA; 


41148 MW 


; 2CFB8F8BF2 6D77CE CRC64; 


Query Match 




13.5%; 


Score 363.5; DB 1; 


Length 



Best Local Similarity 28.9%; Pred. No. 4.5e-19; 
Matches 118; Conservative 65; Mismatches 118; 



Indels 107; Gaps 19; 



Qy 

Db 



56 HADGLALALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSN 115 

I I : : : I I : I : I I : I : I I I I I I : I I II I I I 

44 HFGDLSVSYEP MAYMD AAYFGEI S I GTP PQNFLVLFDTGS SN 85 



Qy 

Db 

Qy 

Db 

Qy 



116 FAV AGTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPK 167 

I I I I I I : I I I I : I ::: I I I I II I I : I : 

86 LWVPSVYCQSQACTSHS RFNPSESSTYSTNGQTFSLQYGSGSLTGFFGYDTLTV — 139 

168 GFNTSFLVNIATIFESENFFLPG IKWNGILGLAYATLAKPSSSLETFFDSLVTQA 222 

I I III II : : : | | : | | | | | | : :: | : | : 

140 QSIQVPNQEFGLSEN — EPGTNFVYAQFDGIMGLAYPTLSVDGAT — TAMQGMVQEG 192 

223 NIPN-VFSMQMCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKL 2 81 
: : : I I : : I : : I I : : I I I : : I I I I I : : I : : I I : I I I : 



Db 



193 ALTSPIFSVYLSDQ QGS S GGAWFGGVDS S LYTGQI YWAPVTQELYWQ I GI EEF 246 



Qy 282 EIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQ 341 

MM: | | : I M \ : M : I I : M : I : : M MM 
Db 247 LIGGQASGW-CSE — GCQAIVDTGTSLLTVPQQYMSALLQA TGAQ 288 

Qy 342 LACWTNSETPWSYF PKISIYLRDENS5RSFRITILPQLYIQPMMGAGLNY 391 

I : I I : : : : I M M 

Db 289 EDEYGQFLVNCNSIQNLPTLTFII NGVEFPLPPSSYI LNN 328 

Qy 392 ECY-RFGISP S T NALVI GAT VME G F YVI FD RAQ KRVG FAAS 431 

I 1:1 I : : I : : I : : I : I I I I I : 

Db 32 9 NGYCTVGVEPTYLSAQNSQPLWILGDVFLRSYYSVYDLSNNRVGFATA 37 6 



RESULT 6 
PEPC_HUMAN 

ID PEPC_HUMAN STANDARD; PRT; 388 AA. 

AC P20142; 

DT 01-FEB-1991 (Rel. 17,. Created) 

DT 01-FEB-1991 (Rel. 17, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Gastricsin precursor (EC 3.4.23.3) (Pepsinogen C) . 

GN PGC. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE^ 88087276; PubMed= 333554 9; 

RA Hayano T., Sogawa K., Ichihara Y., Fu j ii-Kuriyama Y. , Takahashi K. ; 

RT "Primary structure of human pepsinogen C gene."; 

RL J. Biol. Chem. 263:1382-1385(1988). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=8 907 9 67 9 ; PubMed=2 90 9526; 

RA Taggart R.T., Cass L.G., Mohandas T.K., Derby P., Barr P. J., Pals G. , 

RA Bell G.I. ; 

RT "Human pepsinogen C (progastricsin) . Isolation of cDNA clones, 

RT localization to chromosome 6, and sequence homology with pepsinogen 

RT A."; 

RL J. Biol. Chem. 264:375-379(1989). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Placenta; 

RX MEDLINE=8 92 9084 0; PubMed-2567697 ; 

RA Pals G., Azuma T., Mohandas T.K., Bell G.I., Bacon J., 

RA Samloff I.M., Walz D.A., Barr P.J., Taggart R.T.; 

RT "Human pepsinogen C (progastricsin) polymorphism: evidence for a 

RT single locus located at 6p2 1 . 1-pter . " ; 

RL Genomics 4 : 137-148 ( 198 9 ) ! . 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Wong R.N.S., Tang J. ; 

RL Submitted (NOV-1996) to the EMBL/GenBank/DDBJ databases. 

RN [5] 



RP SEQUENCE OF 17-101. 

RX MEDLINE^ 90130402; PubMed-2 515193; 

RA Athauda S.B.P., Tanji M. r Kageyama T . f Takahashi K.; 

RT "A comparative study on the NH2-terminal amino acid sequences and 

RT some other properties of six isozymic forms of human pepsinogens and 

RT pepsins . " ; 

RL J. Biochem. 106:920-927(1989). 

RN [6] 

RP SEQUENCE OF 17-64. 

RX MEDLINE=83079318; PubMed=68 16595 ; 

RA Foltmann B., Jensen A.L.; 

RT "Human progastricsin . Analysis of intermediates during activation 

RT into gastricsin and determination of the amino acid sequence of the 

RT propart . " ; 

RL Eur. J. Biochem. 128:63-70(1982). 

RN [7] 

RP X-RAY CRYSTALLOGRAPHY (1.62 ANGSTROMS). 

RX MEDLINE-95230687; PubMed=7714902 ; 

RA Moore S.A., Sielecki A.R., Chernaia M.M. r Tarasova N.I., James M.N.G.; 

RT "Crystal and molecular structures of human progastricsin at 1.62-A 

RT resolution."; 

RL J. Mol. Biol. 247:466-485(1995). 

RN [8] 

RP X-RAY CRYSTALLOGRAPHY (2.36 ANGSTROMS). 

RX MEDLINE=98069649; PubMed-9406551 ; 

RA Khan A.R., Cherney M.M., Tarasova N.I., James M.N.; 

RT "Structural characterization of activation 'intermediate 2 f on the 

RT pathway to human gastricsin."; 

RL Nat. Struct. Biol. 4:1010-1015(1997). 

CC -!- CATALYTIC ACTIVITY: More restricted specificity than pepsin A, but 

CC shows preferential cleavage at Tyr- | -Xaa bonds; high activity 

CC towards hemoglobin as substrate. 

CC -!- SIMILARITY: Belongs to peptidase family Al . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 
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Fl 


T_T TP T T V 

HELIX 


z o y 


9 Q £ 

z y D 


TTTTI 

FT 


fTIT TDXT 

TURN 


9 0 9 

z y / 


9 Q 1 

z y / 


TTTTI 

FT 


c m n t\ xt t\ 

5 1 RAND 


O Q Q 

z y y 


9 n n 
o U U 


i>i 1 1 

Fl 


rp T TT)TiT 

1 URN 


o UZ 


9 n 9 


r 1 


C* TP Fl "A XT T\ 

b 1 RAND 


9 A ^ 


o U o 


i_.ii 1 1 

FT 


HELIX 


o 1 U 


9 1 R 
olD 


i ii p 

FT 


O m T"> *7V XT T"\ 

STRAND 


j iy 


9 9 9 

oZ O 


TTTTI 

FT 


(TIT TT^XT 

TURN 


oZ4 


oZ 0 


1 ;im 

FT 


C* m Ti 7\ XT T"\ 

STRAND 


o o c 


9 9 n 


FT 


T TTT* T XV 

HELIX 


9 9 9 




Fl 


C* m Tl 7\ XT T\ 

b 1 RAND 


Job 


JJ O 


FT 


STRAND 




9 A C 

o4o 


I 'if i t 

FT 


STRAND 


i4 / 


OCA 


FT 


TURN 


t r r 

355 


356 


FT 


STRAND 


360 


363 


FT 


HELIX 


365 


368 


FT 


TURN 


369 


370 


FT 


STRAND 


371 


376 


FT 


TURN 


377 


380 


FT 


STRAND 


381 


388 


SQ 


SEQUENCE 


388 AA; 


42 



F862DFDC1438BB92 CRC64; 



Query Match 13.1%; Score 353; DB 1; Length 388; 

Best Local Similarity 29.1%; Pred. No. 2.7e-18; 

Matches 12 0; Conservative 65; Mismatches 12 0; Indels 108; 



Gaps 21; 



Qy 

Db 



52 PAERHADG-IJVlxALEPALASPAGAANFI^^ 110 

II:: II:: II : I : I I : I : I I I I I I : I I 

50 PAWKYRFGDLSVTYEP MAYMD AAYFGE I S I GT P PQN FLVL FD 91 



Qy 

Db 

Qy 

Db 

Qy 



111 TGSSNFAV AGTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDL 162 

I I I II I I I II I : I I I I : I : : : I I I I I I I I 

92 TGSSNLWVPSVYCQSQACTSHS RFNPSESSTYSTNGQTFSLQYGSGSLTGFFGYDT 147 

163 VTIPKGFNTSFLVNIATIFESENFFLPG IKWNGILGLAYATLAKPSSSLETFFDS 217 

: I : I I III II : : : I I : I I I I I : : : I 

14 8 LTV QSIQVPNQEFGLSEN — EPGTNFVYAQFDGIMGLAYPALSVDEAT — TAMQG 198 

218 LVTQANIPN-VFSMQMCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQI 276 
: | : : : | | | : : | : : | | : : | | | : : I II I I : : I : : I I : I I 



Db 



199 MVQEGALTSPVFSVYLSNQ QGS SGGAWFGGVDS SLYTGQI YWAPVTQELYWQI 252 



Qy 277 EILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGF 336 

I : MM: II : M I I : I I : I I : I I : | : : : | 
Db 253 GIEEFLIGGQASGW-CSE — GCQAI VDTGT S LLTVPQQ YMSALLQA 295 

Qy 337 WTGSQLACWTNSETPWSYF PKISIYLRDENSSRSFRITILPQLYIQPMMG 386 

MM I : I I : : : I II 
Db 296 -TGAQ EDEYGQFLVNCNSIQNLPSLTFI I NGVEFPLPPSSYI 336 

Qy 387 AGLNYECY- RFGI S P STNA LVI GATVMEGFYVI FDRAQKRVGFAAS 431 

I : I I : I II : M : : I : : I Mill: 

Db 337 — LSNNGYCTVGVEPTYLSSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATA 387 



RESULT 7 
PEPC_CALJA 

ID PEPC_CALJA STANDARD; PRT; 388 AA. 

AC Q9N2D3; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Gastricsin precursor (EC 3.4.23.3) (Pepsinogen C) . 

GN PGC . 

OS Callithrix jacchus (Common marmoset) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Platyrrhini; Callitrichidae; 

OC Callithrix. 

OX NCBI_TaxID=94 83 ; 

RN [1] 

RP SEQUENCE FROM N.A., SEQUENCE OF 17-26, FUNCTION, AND ENZYME 

RP REGULATION. 

RC TISSUE=Gastric mucosa; 

RX MEDLINE=20250834; PubMed-10788784 ; 

RA Kageyama T . ; 

RT "New World monkey pepsinogens A and C, and prochymosins . Purification, 
RT characterization of enzymatic properties, cDNA cloning, and molecular 
RT evolution."; 

RL J. Biochem. 127:761-770(2000). 

CC -!- FUNCTION: Hydrolyzes a variety of proteins. 

CC -!- CATALYTIC ACTIVITY: More restricted specificity than pepsin A, but 
CC shows preferential cleavage at Tyr- | -Xaa bonds; high activity 

CC towards hemoglobin as substrate. 

- ENZYME REGULATION: Inhibited by pepstatin. 

- MISCELLANEOUS: The optimal pH is around 2. 

- SIMILARITY: Belongs to peptidase family Al . 



CC 
CC 
CC 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AB038385; BAA90872.1; 

DR PIR; JC7246; JC7246. 



DR 


HSSP; P20142; 1AVF. 








DR 


MEROPS; A01.003; 








DR 


InterPro; 


IPR001969; Aspprotease AS. 


DR 


InterPro ; 


IPR009007; Pept_ 


A acid. 


DR 


InterPro; 


IPR001461; Peptidase Al . 


DR 

X-/ x \ 


Pfam; PF00026; asp; 


1. 






DR 


PRINTS; PR00792; PEPSIN. 






DR 


PROSITE; 


PS00141; ASP_PROTEASE; 2. 


I\ vv 


Hydrolase 


; Aspartyl 


protease; 


Zymogen; Digestion 


FT 


SIGNAL 


1 


16 






r 1 


PROPEP 


17 


59 




ACTIVATION PEPTIDE 


FT 


CHAIN 


60 


388 




GASTRICSIN. 


FT 


ACT_SITE 


91 


91 




BY SIMILARITY. 


FT 


ACT_SITE 


276 


276 




BY SIMILARITY. 


FT 


DISULFID 


104 


109 




BY SIMILARITY. 


FT 


DISULFID 


267 


271 




BY SIMILARITY. 


FT 


DISULFID 


310 


343 




BY SIMILARITY. 


SQ 


SEQUENCE 


388 AA; 


42503 


MW; 


0BC48DBD1F7D2D8C - 



Query Match 13.1%; Score 351.5; DB 1; Length 388; 

Best Local Similarity 30.1%; Pred. No. 3.5e-18; 

Matches 112; Conservative 56; Mismatches 115; Indels 89; Gaps 17; 

Qy 92 YYLEMLI GT P PQKLQI LVDTGS SN FAV AGTPHSYIDTYFDTERSSTYRSKGF 143 

I : I • I I I I I I : I I I I I I I I I I I I I : I I I I I I 

Db 73 YFGEI S I GTP PQNFLVLFDTGS SNLWVPSVYCQSQACTSHS RFNPSASSTYSSNGQ 128 

Qy 144 DVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG IKWNGILG 198 

: : : I I I I I I I I : I : I I III I I : : : | | : | 

Db 129 TFSLQYGSGSLTGFFGYDTLTV QSIQVPNQEFGLSEN — EPGTNFVYAQFDGIMG 181 

Qy 199 LAYATLAKPSSSLETFFDSLVTQANIPN-VFSMQMCGAGLPVAGSGTNGGSLVLGGIEPS 257 

III I : : : I : : : : : I I I : I : : I I : : : I I : : | 

Db 182 LAY PAL SMGGAT — TAMQGMLQEGALT S PVFS FYLSNQ QGSSGGAVIFGGVDSS 233 



Qy 258 LYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFD 317 

I I I I : : I : : I I : II I : MM: II : I I I I : I I : I I : I I : 

Db 234 LYTGQIYWAPVTQELYWQIGIEEFLIGGQASGW-CSE — GCQAIVDTGTSLLTVPQQYMS 290 

Qy 318 AWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYF PKISIYLRDENS 367 

I : I I I I : I I : I I :: : 
Db 291 AFLEA TGAQ EDEYGQFLVNCDSIQNLPTLTFII 323 

Qy 368 SRSFRITILPQLYIQPMMGAGLNYECY-RFGISP STNALVT GATVMEGFYVI F 419 

: I I I I : I I : I I : : I : : I : | 

Db 324 -NGVEFPLPPSSYI LSNNGYCTVGVEPTYLSSQNSQPLWILGDVFLRSYYSVF 375 

Qy 42 0 DRAQKRVGFAAS 431 

I Mill: 

Db 376 D L GNN RVG FATA 387 



RESULT 8 
PEPC_CAVPO 

ID PEPC_CAVPO STANDARD; PRT; 394 AA. 

AC Q64411; 

DT 15-JUL-1998 (Rel. 36, Created) 



DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Gastricsin precursor (EC 3.4.23.3) (Pepsinogen C) . 

GN PGC. 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Hystricognathi ; Caviidae; Cavia. 

OX NCBI_TaxID=10141; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=92 355614; PubMed= 1644 82 9; 

RA Kageyama T., Ichinose M. , Tsukada S., Miki K., Kurokawa K. , Koiwai O., 

RA Tanji M. , Yakabe E., Athauda S.B., Takahashi K. ; 

RT "Gastric procathepsin E and progastricsin from guinea pig. 

RT Purification, molecular cloning of cDNAs, and characterization of 

RT enzymatic properties, with special reference to procathepsin E."; 

RL J. Biol. Chem. 267:16450-16459(1992). 

CC -!- CATALYTIC ACTIVITY: More restricted specificity than pepsin A, but 

CC shows preferential cleavage at Tyr- | -Xaa bonds; high activity 

CC towards hemoglobin as substrate. 

CC -!- SIMILARITY: Belongs to peptidase family Al . 

CC ■ : 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M88652; AAA37053.1; -. 

DR PIR; B43356; B43356. 

DR HSSP; P20142; 1AVF. 

DR MEROPS; A01.003; -. 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR009007; Pept_A_acid. 

DR InterPro; IPR001461; Peptidase_Al . 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE; 2. 

KW Hydrolase; Aspartyl protease; Zymogen; Digestion; Signal. 

FT SIGNAL 1 16 POTENTIAL. 

FT PROPEP 17 65 ACTIVATION PEPTIDE. 

FT CHAIN 66 394 GASTRICSIN. 

FT ACTJSITE 97 97 BY SIMILARITY. 

FT ACT_SITE 283 283 BY SIMILARITY. 

FT DISULFID 110 115 BY SIMILARITY. 

FT DISULFID 273 277 BY SIMILARITY. 

FT DISULFID 316 349 BY SIMILARITY. 

SQ SEQUENCE 394 AA; 42995 MW; 114F08E105D4 98 65 CRC64; 

Query Match 12.1%; Score 324.5; DB 1; Length 394; 

Best Local Similarity 29.0%; Pred. No. 3.2e-16; 

Matches 107; Conservative 63; Mismatches 116; Indels 83; Gaps 18; 

Qy 92 YYLEMLIGTP PQKLQI LVDTGS SNF AVAGTPHSYIDTYFDTERSSTYRSKGF 143 

I : : : : I I I I I I : I I I I I I I : : I I I I h MM: 



Db 



79 YFGQISLGTPPQSFQVLFDTGSSNLWVPSVYCSSLACTTH TRFNPRDSSTYVATDQ 134 



Qy 144 DVTVKYTQGSWTGFVGEDLVTI PK-GFNTSFLVNIATIFESENFFLPG IK 192 

: : : I I I II I I : I I I I I I I : I II : 

Db 135 SFSLEYGTGSLTGVFGYDTMTIQDIQVPKQEFGLS ETE PGSDFVYAE 181 

Qy 193 WN G I LGLAYAT LAK P S S S LET FFD S LVT QAN I - PNVF S MQMC GAGL P VAG S — GTNGGSL 249 

:: I I I I I I I : : : : I I : : : : : I I : : II I : : I I 

Db 182 FDGILGLGYPGLSEGGAT — TAMQGLLREGALSQSLFSVYL GSQQGSDEGQL 231 

Qy 250 VLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLL 309 

: I I I : : I I I I I I : : I I : : I I : I I I I I : I : I I I : I I : I I 

Db 232 I LGGVDESLYTGDI YWTPVTQELYWQI GI EGFLI DGSASGWCSR GCQGIVDTGTSLL 288 



Qy 310 RLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSR 369 

: I : I : I : I : : I : : | : : | I : 

Db 289 T VP S D YL ST LVQAI GAEE — NEYGEYF VSCSSIQDLPTLTFVISGV 332 

Qy 37 0 SFRITILPQLYIQP MMGAGLN YECYRFGI S PSTN — ALVI GATVMEGFYVI FDRA 422 

: I I I I : I I : I I : : I : : I : : I I 

Db 333 — EFPLSPSAYILSGENYCMVGLESTY VSPGGGEPVWILGDVFLRSYYSVYDLA 384 



Qy 423 QKRVGFAAS 431 

I I I I I : 

Db 385 NNRVGFATA 393 



RESULT 9 




REN I 


MOUSE 




ID 


REN I MOUSE STANDARD; PRT; 402 AA. 




AC 


P06281; P97911; Q62153; Q62154; 




DT 


01-JAN-1988 (Rel. 06, Created) 




DT 


01-JAN-1988 (Rel. 06, Last sequence update) 




DT 


15-MAR-2004 {Rel. 43, Last annotation update) 




DE 


Renin 1 precursor (EC 3.4.23.15) (Angiotensinogenase) 


(Kidney renin) . 


GN 


REN1 OR REN-1 OR REN. 




OS 


Mus musculus (Mouse) . 




OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 


OC 


Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; 


Murinae; Mus. 


OX 


NCBI TaxID=10090; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=BALB/c; 




RX 


MEDLINE-84 182525; PubMed=6370686; 




RA 


Holm I., Olio R. , Panthier J. -J., Rougeon F. ; 




RT 


"Evolution of aspartyl proteases by gene duplication: 


the mouse renin 


RT 


gene is organized in two homologous clusters of four 


exons . " ; 


RL 


EMBO J. 3:557-562(1984). 




RN 


[2] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=BALB/ c; TISSUE=Kidney ; 




RX 


MEDLINE=90067 953; PubMed=2685761 ; 




RA 


Kim W.S., Murakami K., Nakayama K. ; 




RT 


"Nucleotide sequence of a cDNA coding for mouse Renl 


preprorenin . " ; 


RL 


Nucleic Acids Res. 17:9480-9480(1989). 




RN 


[3] 




RP 


SEQUENCE FROM N.A. 





RC STRAIN=DBA/2, and C57BL/10; 

RX MEDLINE=90108722; PubMed=2691339; 

RA Burt D.W., Mullins L.J., George H., Smith G. , Brooks J., Pioli D., 

RA Brammar W.J.; 

RT "The nucleotide sequence of a mouse renin-encoding gene, Ren-ld, and 

RT its upstream region."; 

RL Gene 84:91-104(1989). 

RN [4] 

RP SEQUENCE OF 1-30 FROM N.A. 

RC TISSUE=Kidney; 

RX MEDLINE= 84298161; PubMed= 608 9205; 

RA Panthier J. -J., Dreyfus M. , Roux D.T.L., Rougeon F. ; 

RT "Mouse kidney and submaxillary gland renin genes differ in their 5 f 

RT putative regulatory sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 81:5489-5493(1984). 

RN [5] 

RP SEQUENCE OF 1-31 FROM N.A. 

RX MEDLINE=85085936; PubMed=6392850; 

RA Field L.J., Philbrick W.M., Howies P.N., Dickinson D.P., 

RA McGowan R.A., Gross K.W. ; 

RT "Expression of tissue-specific Ren-1 and Ren-2 genes of mice: 

RT comparative analysis of 5 '-proximal flanking regions."; 

RL Mol. Cell. Biol. 4:2321-2331(1984). 

RN [6] 

RP SEQUENCE OF 22-37 AND 72-80. 

RC STRAIN=C57BL/10ROS X C3H/HEROS; TISSUE=Kidney ; 

RX MEDLINE=97182599; PubMed-9030738 ; 

RA Jones C.A. r Petrovic N., Novak E.K., Swank R.T., Sigmund CD., 

RA Gross K.W. ; 

RT "Biosynthesis of renin in mouse kidney tumor As4.1 cells."; 

RL Eur. J. Biochem. 243:181-190(1997). 

CC -!- FUNCTION: Renin is a highly specific endopeptidase, whose only 
CC known function is to generate angiotensin I from angiotensinogen 

CC in the plasma, initiating a cascade of reactions that produce an 

CC elevation of blood pressure and increased sodium retention by the 

CC kidney. 

CC -!- CATALYTIC ACTIVITY: Cleaves Leu- | - bond in angiotensinogen to 

CC generate angiotensin I. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- TISSUE SPECIFICITY: Kidney. 

CC -!- INDUCTION: Renal renin is synthesized by the juxtaglomerular cells 
CC of the kidney in response to decreased blood pressure and sodium 

CC concentration. 

CC -!- POLYMORPHISM: In inbred mouse strains, there are at least two 
CC alleles which can occur at the Renl locus: Ren-ID and Ren-lC. 

CC The sequence shown is that of Ren-lC. 

CC -!- SIMILARITY: Belongs to peptidase family Al . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X00810; CAA25391.1; 



UK 


PiyfDT . 
XjiYlDXi / 


v n n p i i • 




i ■ 

-L r 




JJK 


JMYLrSXi r 


YfiflRI O . 
AUUol/ / 


\^£\t\£* O _7 J_ • 


1 • 

X /■ 


U W X IN Hi U ■ 


UK 


PUD T • 


v n n p i q • 
auuoio ; 




1 . 
X r 


.TOTM'P'n 

UU11N HiL> ■ 


UK 


T7TVJTT3 T • 


aU U o 14 ; 




1 . 
X f 


UU11N XjU . 


nn 

DR 








1 . 
X f 


TnTNTFFl 
uUIINIjU • 


nn 
UK 


T7 1 '|\/f"Q t . 


au uo j.o; 




1 . 

X / 




nn 
UK 


HjMdl ; 


AUUo ju; 




1 . 

X f 


.TnTMFPi 


nn 

DK 




AUUODl, 




1 . 

X / 


CjL/ . 


n n 

DR 


EMBL; 


X16642; 


CAA34636. 


-L / 




DR 


EMBL; 


K02596; 


AAA40045. 


i ; 




DR 


EMBL; 


M32352; 


AAA40043. 


1; 




DR 


EMBL; 


K02800; 


AAA40044 . 


1; 




DR 


EMBL; 


M34190; 


AAA40042. 


l; 




DR 


PIR; 


A00989; 


REMSK. 






DR 


HSSP; 


P00796; 


1SMR. 






DR 


MEROPS; A01.007; -- 






DR 


MGD; 


MGI: 97898; Renl. 







DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR009007; Pept_A_acid. 

DR InterPro; IPR001461; Peptidase_Al . 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN . 

DR PROSITE; PS00141; ASP_PROTEASE; 2. 

KW Hydrolase; Aspartyl protease; Plasma; Glycoprotein; Zymogen; 



KW 


Signal . 












FT 


SIGNAL 


1 


21 








FT 


PROPEP 


22 


71 


ACTIVATION PEPTIDE. 






FT 


CHAIN 


72 


402 


RENIN 1. 






FT 


ACT SITE 


102 


102 


BY SIMILARITY. 






FT 


ACT SITE 


287 


287 


BY SIMILARITY. 






FT 


DISULFID 


115 


122 


BY SIMILARITY. 






FT 


DISULFID 


278 


282 


BY SIMILARITY. 






FT 


CARBOHYD 


69 


69 


N-LINKED (GLCNAC. . 


.) 


( POTENTIAL) 


FT 


CARBOHYD 


139 


139 


N-LINKED (GLCNAC. . 


. ) 


(POTENTIAL) 


FT 


CARBOHYD 


320 


320 


N-LINKED (GLCNAC. . 


.) 


(POTENTIAL) 


FT 


VARIANT 


58 


58 


W -> R (in Ren-ID) . 






FT 


VARIANT 


68 


68 


T -> I (in Ren-ID) . 






FT 


VARIANT 


160 


160 


S -> V (in Ren-ID) . 






FT 


VARIANT 


315 


315 


E -> D (in Ren-ID) . 






FT 


VARIANT 


352 


352 


N -> Y (in Ren-ID) . 






FT 


CONFLICT 


6 


23 


MISSING (IN REF. 1) 


* 




FT 


CONFLICT 


24 


24 


T -> I (IN REF. 1) . 






FT 


CONFLICT 


163 


163 


V -> VSRV (IN REF. 


1) • 




SQ 


SEQUENCE 


4 02 AA; 


44342 


MW; D42920B555E97A38 


CRC64; 



Query Match 11.9%; Score 320; DB 1; Length 402; 

Best Local Similarity 28.6%; Pred. No. 7e-16; 

Matches 126; Conservative 66; Mismatches 181; Indels 68; Gaps 21; 

Qy 10 L P L LAQWL L RAAP E LAP AP FT L P L RVAAATN RWAP T P G- PGT PAERHAD GLALAL E 65 

: I I I I I : I I : I I I I : I I I I hi 

Db 6 MPLWALLLL WSPCTFSLPTRTATFERIPLKKMPSVREILEERGVDMTRLSAEWGV 60 



Qy 66 PA LAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAV 118 

| : Ml I : I || 111:111111 : : : I II I : I I 

Db 61 FT KRP S LTNLT S P WLTN YL NTQ YYGEIGIGTPPQTFKVI FDTGSANLWV 110 



Qy 119 AGTPHSY IDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTS 172 

| | | : : : : | | : | I I I : I I I I : : I I I : I : 

Db 111 PSTKCSRLYLACGIHSLYESSDSSSYMENGSDFTIHYGSGRVKGFLSQDSVTV-GGITVT 169 

Qy 173 FLVNIATIFESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANI-PNVFSMQ 231 

| || | : : | : | | : : I : : I I : : : I : III: 

Db 170 QTFGEVTELPLI PFML — AKFDGVLGMGFP — AQAVG GVT P VF DH I L S Q GVL KE EVF S VY 225 

Qy 232 MCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLD 2 91 

II I I : I I I I : I I : I : I I : : M = : : I I I 

D b 226 Y NRGSHLLGGEWLGGSDPQHYQGNFHYVSISKTDSWQITMKGVSVG — SSTLL 277 

Qy 292 CREYNADKAIVDSGTTLLRLPQKVFDAWEAV-ARASLIPEFSDGFWTGSQLACWTNSET 350 

|| | : | | : | : : : I : : : I : I : I h : I I = 

Db 278 CEEGCA — VWDTGS S FI SAPTS SLKLIMQALGAKEKRI EEY WNC SQV 324 

Qy 351 PWSYFPKISIYLRDENSSRSFRITILPQLYIQPMMGAGL-NYECYRFGISPSTNAL-VIG 408 

| IN | | : : : : : I I : III : hi 

Db 325 P — TLPDISFDL GGRAYTLSSTDWLQYPNRRDKLCTLALHAMDIPPPTGPVWVLG 378 

Qy 4 09 ATVMEGFYVI FDRAQKRVGFA 42 9 

||: II III I : I I I 

Db 37 9 ATFIRKFYTEFDRHNNRIGFA 399 



RESULT 10 
CATD_CLUHA 

ID CATD_CLUHA STANDARD; PRT; 39 6 AA. 

AC Q9DEX3; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Cathepsin D precursor (EC 3.4.23.5). 

OS Clupea harengus (Atlantic herring) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Clupeomorpha ; Clupeidae; 

OC Clupea. 

OX NCBI_TaxID=7950; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Nielsen L.B., Stougaard P., Andersen P.S., Pedersen L.H.; 

RT "Cloning and sequence determination of herring muscle cathepsin D. 

RL Submitted (OCT-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 62-82. 

RC TISSUE=Skeletal muscle; 

RX MEDLINE=21165469; PubMed-112 074 4 7 ; 

RA Nielsen L.B., Nielsen H.H.; 

RT "Purification and characterization of cathepsin D from herring muscle 

RT (Clupea harengus)."; 

RL Comp. Biochem. Physiol. 128B : 351-363 (2001) . 

CC -!- FUNCTION: Cathepsin D is an acid protease active in intracellular 
CC protein breakdown. 

CC -!- CATALYTIC ACTIVITY: Specificity similar to, but narrower than, 
CC that of pepsin A. Does not cleave the 4-Gln- I -His-5 bond in B 

CC chain of insulin. 

CC -!- ENZYME REGULATION: Inhibited by pepstatin. 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 



i _ 

I _ 
i _ 



_ t _ 



SUBUNIT : Monomer . 

SUBCELLULAR LOCATION: Lysosomal. 

MISCELLANEOUS: The isoelectric point is 6.8. Has optimal activity 
at pH 2.5 with hemoglobin as the substrate and the optimal 
temperature is 37 degrees Celsius . 
SIMILARITY: Belongs to peptidase family Al . 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AF312364; AAG27733.1; -. 
HSSP; P07339; 1LYB. 
MEROPS; A01.009; -. 

InterPro; IPR001969; Aspprotease_AS . 

InterPro; IPR009007; Pept_A_acid. 

InterPro; IPR001461; Peptidase_Al . 

Pfam; PF00026; asp; 1. 

PRINTS; PR00792; PEPSIN. 

PROSITE; PS00141; ASP_PROTEASE; 2. 

Hydrolase; Aspartyl protease; Glycoprotein; Lysosome; Signal; Zymogen. 



FT 


SIGNAL 


1 


18 




POTENTIAL. 




FT 


PROPEP 


19 


61 




ACTIVATION PEPTIDE. 




FT 


CHAIN 


62 


396 




CATHEPSIN D. 




FT 


ACT_SITE 


94 


94 




BY SIMILARITY. 




FT 


ACT_SITE 


281 


281 




BY SIMILARITY. 




FT 


DISULFID 


107 


114 




BY SIMILARITY. 




FT 


DISULFID 


272 


276 




BY SIMILARITY. 




FT 


DISULFID 


315 


352 




BY SIMILARITY. 




FT 


CARBOHYD 


131 


131 




N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


SQ 


SEQUENCE 


396 AA; 


43315 MW; D0375DC38567A31B 


CRC64; 


Query Match 




11. 


7%; 


Score 315.5; DB 1; 


Length 396; 


Best Local Similarity 


27. 


1%; 


Pred. No. 1.5e-15; 




Matches 112 ; 


Conservative 




65; Mismatches 141; 


Indels 95; 



18; 



Qy 

Db 



50 GTPAERHADGLALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILV 109 

It:: I I : | : | | : : I I I :: I I I I : : 

47 GTNSLQHNQGFPSSNAP TPETLKNYM DAQ Y YGE I GLGT P VQMFT WF 93 



Qy 

Db 

QY 
Db 

Qy 

Db 



110 DT GS SN FAVAGT PH S Y I DT YFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLV 163 

I I I I I I : 1:1 : : : I I I I I : : : I I I : I : : : I 

94 DTGSSNLWLPSIHCSFTDIACLLHHKYNGAKSSTYVKNGTEFAIQYGSGSLSGYLSQDSC 153 

164 TIPKGFNTSFLVNIATIFESENFFLPGI KWNGILGLAYATLAKPSSSLETFFDSL 218 

II : I : I I II: I : : I I I I : I I : : : II : 

154 TI GDIWEKQLF-GEAI KQPGVAFIAAKFDGILGMAYPRIS — VDGVP PVFDMM 204 

219 VT QAN I - PN VFS MQMC GAG L P VAGS GTN GGS LVLGGI EPS LYKGDI WYT P I KEEW 272 

: : I : I I I I : I I I I : I I I : I III I I : : 

205 MSQKKVEQNVFS FYL NRNPDTEPGGELLLGGTDPKYYTGDFNYVPVTRQA 254 



Qy 



273 YYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLI 329 



I : | | : : I I I I I I : : : I I I I : I I : I : I I : : I : I I 

Db 255 YWQIHMDGMSIGSQ-LTL-CKD--GCEAIVDTGTSLITGPPAEVRALQKAIGAIPLIQGE 310 

Qy 330 PEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITILPQ 378 

I I I I : III I I : : : : : : I 

Db 311 YMI DCKKVPTLPT I S - - FNVGGK TYSLTGEQY VLKESQGGKTICLSGLMG 358 

Qy 379 LYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVT FDRAQKRVGFAAS 431 

Ml I : : : I : : I : I I I I I I I I I 

D b 359 LEIPP PAGPLWILGDVFIGQYYTVFDRESNRVGFAKS 395 



RESULT 11 
APR1JDRYSA 

ID APR1_0RYSA STANDARD; PRT; 509 AA. 

AC Q42456; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Aspartic proteinase oryzasin 1 precursor (EC 3.4.23.-). 

OS Oryza sativa (Rice) . 

OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta ; 

OC Spermatophyta; Magnoliophyta; Liliopsida; Poales; Poaceae; 

OC Ehrhartoideae; Oryzeae; Oryza. 

OX NCBI_TaxID=4 530; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Nipponbare / Japonica; TISSUE=Seed; 

RX MEDLINE=96 04 8 031; PubMed=7 556174; 

RA Asakura T., Watanabe H . , Abe K. , Arai S.; 

RT "Rice aspartic proteinase, oryzasin, expressed during seed ripening 

RT and germination, has a gene organization distinct from those of 

RT animal and microbial aspartic proteinases."; 

RL Eur. J. Biochem. 232:77-83(1995). 

CC -!- DEVELOPMENTAL STAGE: Seed ripening and germination. 

CC -!- SIMILARITY: Belongs to peptidase family Al . 

CC 7 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; D32165; BAA06876.1; 

DR EMBL; D32144; BAA06875.1; -. 

DR PIR; S66516; S66516. 

DR HSSP; P42210; 1QDM. 

DR MEROPS; A01.020; -. 

DR Gramene; Q42456; -. 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR009007; Pept_A_acid. 

DR InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR007856; SapB_l . 

DR InterPro; IPR008138; SapB_2 . 

DR InterPro; IPR008140; SapB_sub. 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



InterPro; IPR008373; Saposin. 
InterPro; IPR008139; SaposinB. 
Pfam; PF00026; asp; 1. 
Pfam; PF05184; SapB_l; 1. 
Pfam; PF03489; SapB_2; 1. 
PRINTS; PR00792; PEPSIN. 
PRINTS; PR01797; SAPOSIN. 
ProDom; PD001732; SapB_sub; 1. 
SMART; SM00118; SAPB; 2. 
PROSITE; PS00141; ASP_PROTEASE; 2. 

Hydrolase; Aspartyl protease; Zymogen; Glycoprotein; Signal. 



SIGNAL 
PROPEP 
CHAIN 
DOMAIN 

ACTJSITE 
ACT_SITE 
DISULFID 
DISULFID 
DISULFID 
CARBOHYD 
CARBOHYD 
SEQUENCE 



1 
21 
68 
318 

103 
290 
116 
281 
428 
252 
400 
509 AA; 



20 
67 
509 
416 

103 
290 
122 
285 
465 
252 
400 



POTENTIAL. 
POTENTIAL. 

ASPARTIC PROTEINASE ORYZASIN 1. 
SPECIFIC TO PLANT AS PART I C PROTEINASES 
(BY SIMILARITY) . 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
N-LINKED (GLCNAC. . 
N-LINKED (GLCNAC. . 



) 



(POTENTIAL) . 
(POTENTIAL) . 



54145 MW; 182F5DADA4CBE358 CRC64; 



Query Match 11.7%; Score 313.5; DB 1; 

Best Local Similarity 23.0%; Pred. No. 2.9e-15; 
Matches 127; Conservative 75; Mismatches 179; 



Length 509; 

Indels 171; Gaps 19; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



3 ALARALLLPLIAQWLLRTKAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHTVDGIJ^ 62 



: : I II : I I I I I : I 



I I 



I 



I I I I 



5 SVALVLLAAVLLQALLPASAEEGLVRIALKKRPIDENSRVAARLSG EEGARRLGL 59 

63 ALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLI GTPPQKLQI LVDTGS SN 115 

: I I : : | : : : I : I : : I I I I I I :: I I I I I I 

60 RGANSLGGGGGEGDIVALKNYMNAQ YFGEI GVGT P PQKFTVI FDTGS SNLWVP SAK 115 

116 — FAVAGTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSF 173 

I : : I II : : : | | | | : I : : I II II Mil: 

116 CYFSIACFFHS RYKSGQSSTYQKNGKPAAIQYGTGSIAGFFSEDS VTVGD 165 



174 LVNIATIFESENFF LPGI 

: : : I I I : 

166 LWKDQEFIEATKEPGLTFMVAKFDGILGLGFQEISVGDA- 



KWNGILGLAYATLAKPSSSLETFFDSLVTQANI 224 
I : : I I I I I : : : : : 



-V 206 



225 PNVFSMQMCG-AGLPVAGSGTN- 
I : I I II I 



GGSLVLGGIEPSLYKGDIWYTPIKEEWYYQI 276 



I I : I I I : : I I II I : I I : 



I : I 



207 PVWYKMVEQGLVSEPVFSFWFNRHSDEGEGGEIVFGGMDPSHYKGNHTYVPVSQKGYWQF 266 



277 EILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPE- 



331 



I 



I I I I I I : I I I 



267 EMGDVLIGGKTTGF-CA — SGCSAIADSGTSLLAGPTAIITEINEKIGATGWSQECKTV 323 



Qy 

Db 



332 FSDGF 336 

I : I 

324 VSQYGQQILDLLLAETQPSKICSQVGLCTFDGKHGVSAGIKSWDDEAGESNGLQSGPMC 383 



Qy 337 WTGSQLACWTNSETPWSY FPKISIYLRD 364 

I : I I I : : I I : I I : 

Db 384 NACEMAVVWQNQLAQNKTQDLILNYINQLCDKLPSPMGESSVDCGSLASMPEISFTIGA 443 

Qy 365 ENSSRSFRITILPQLYIQPMMGAGLNYECY RFGI S P STNAL- VI GATVMEGFYVI F 419 

: : : | : | | : | | : | I I I : : I | : : : | 

Db 444 K KFALKPEEYIL-KVGEGAAAQCISGFTAMDIPPPRGPLWILGDVFMGAYHTVF 496 

Qy 42 0 DRAQKRVGFAAS 431 

I : I I I I I I 

Db 4 97 DYGKMRVGFAKS 508 



RESULT 12 
PEPC_RAT 

ID PEPC_RAT STANDARD; PRT; 392 AA. 

AC P04073; 

DT 01-NOV-198 6 (Rel. 03, Created) 

DT 01-NOV-1986 (Rel. 03, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Gastricsin precursor (EC 3.4.23.3) (Pepsinogen C) . 

GN PGC. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Wistar; 

RX MEDLINE= 8 9255508; PubMed=2 722863; 

RA Ishihara T., Ichihara Y., Hayano T., Katsura I., Sogawa K., 

RA Fuj ii-Kuriyama Y. , Takahashi K. ; 

RT "Primary structure and transcriptional regulation of rat pepsinogen C 

RT gene . " ; 

RL J. Biol. Chem. 264:10193-10199(1989). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Wistar; 

RX MEDLINE=8 7 054 020; PubMed=37 8 0741; 

RA Ichihara Y., Sogawa K., Morohashi K. , Fuj ii-Kuriyama Y., Takahashi K.; 

RT "Nucleotide sequence of a nearly full-length cDNA coding for 

RT pepsinogen of rat gastric mucosa."; 

RL Eur. J. Biochem. 161:7-12(1986). 

RN [3] 

RP SEQUENCE OF 16-112. 

RC STRAIN=Wistar; 

RX MEDLINE=84257697; PubMed-6743670; 

RA Arai K.M., Muto N . , Tani S., Akahane K. ; 

RT "The N-terminal sequence of rat pepsinogen."; 

RL Biochim. Biophys . Acta 788:256-261(1984). 

CC -!- CATALYTIC ACTIVITY: More restricted specificity than pepsin A, but 

CC shows preferential cleavage at Tyr- | -Xaa bonds; high activity 

CC towards hemoglobin as substrate. 

CC -!- SIMILARITY: Belongs to peptidase family Al . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 



CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://vmw.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 



DR 


EMBL; 


■ M25993, 


; AAA41827. 


1; 




DR 


EMBL, 


; M25985, 


; AAA41827. 


1; 


JOINED. 


DR 


EMBL, 


M25986, 


; AAA41827. 


1; 


JOINED. 


DR 


EMBL, 


: M25987, 


; AAA41827. 


1; 


JOINED. 


DR 


EMBL, 


• M25988, 


; AAA41827. 


1; 


JOINED. 


DR 


EMBL, 


: M25989, 


r AAA41827. 


1; 


JOINED. 


DR 


EMBL, 


: M25990, 


; AAA41827. 


1; 


JOINED. 


DR 


EMBL, 


: M25991, 


? AAA41827. 


1; 


JOINED. 


DR 


EMBL, 


? M25992, 


; AAA41827. 


1; 


JOINED. 


DR 


EMBL, 


? X04644, 


? CAA28305. 


1; 


• 


DR 


PIR; 


A33510; 


A24608. 






DR 


HSSP; P20142; 1AVF. 







DR MEROPS; A01.003; 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR009007; Pept_A_acid. 

DR InterPro; IPR001461; Peptidase_Al . 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE; 2. 

KW Hydrolase; Aspartyl protease; Zymogen; Digestion; Signal. 



FT 


SIGNAL 


1 


16 




FT 


PROPEP 


17 


62 


ACTIVATION PEPTIDE. 


FT 


CHAIN 


63 


392 


GASTRICSIN. 


FT 


ACT SITE 


94 


94 




FT 


ACT SITE 


280 


280 




FT 


DISULFID 


107 


112 


BY SIMILARITY. 


FT 


DISULFID 


270 


275 


BY SIMILARITY. 


FT 


DISULFID 


314 


347 


BY SIMILARITY. 


FT 


CONFLICT 


31 


31 


E -> Q (IN REF. 3) . 


FT 


CONFLICT 


103 


103 


S -> A (IN REF. 3) . 


FT 


CONFLICT 


109 


109 


S -> L (IN REF. 3) . 


SQ 


SEQUENCE 


392 AA; 


42833 


MW; 092A5EAF27 83EDD1 



Query Match 11.6%; Score 313; DB 1; Length 392; 

Best Local Similarity 29.5%; Pred. No. 2.2e-15; 

Matches 105; Conservative 56; Mismatches 139; Indels 56; Gaps 16; 

92 YYLEMLIGTPPQKLQILVDTGSSNFAV AGTPHSYIDTYFDTERSSTYRSKGF 143 

I : I : I I II I I : I I I I I I I I III: I : : I I I I :: I 

76 YFGEISIGTPPQNFLVLFDTGSSNLWVSSVYCQSEACTTHA RFNPSKSSTYYTEGQ 131 



Qy 144 DVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPG IKWNGILG 198 

: : : I M I II I I : I : I I I I I II : : : I I : I 

Db 132 TFSLQYGTGSLTGFFGYDTLTV QSIQVPNQEFGLSEN — EPGTNFVYAQFDGIMG 18 4 

Qy 199 LAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGS — GTNGGSLVLGGIEP 256 

Ml | : | I : : : I : I I II I : I I I : I II : : 
Db 185 LAYPGLS — SGGATTALQGMLGE GALSQPLFGVYL GSQQGSNGGQIVFGGVDK 235 

Qy 257 S LYKGDIWYTPI KEEWYYQI EI LKLEI GGQSLNLDCREYNADKAI VDSGTTLLRLPQKVF 316 

: | | | : I : I : : I I : I I I III: I : | | | : | | : I I : 1 : 



Db 



236 NLYTGEITWVPVTQELYWQITIDDFLIGDQASGW-CSSQGC-QGIVDTGTSLLVMPAQYL 293 



Qy 317 DAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITIL 376 

: : : : I : : I : : I : I I : I I = : 

Db 294 SELLQTIGAQE— GEYGEYF VSCDSVSS LPTLSFVL NGVQFPLS 335 



Qy 377 PQL Y- 1 QPMMGAGLN YECYRFGI S P STNALVI GATVMEGFYVI FDRAQKRVGFAAS 431 

| III : | : : I : : I I I I : I I I I 

Db 336 PSSYIIQEDNFCMVGLESISLTSESGQPLWILGDVFLRSYYAI FDMGNNKVGLATS 391 



RESULT 13 
PEPE_CHICK 

ID PEPE_CHICK STANDARD; PRT; 383 AA. 

AC P16476; 

DT 01-AUG-1990 (Rel. 15, Created) 

DT 01-AUG-1990 (Rel. 15, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Embryonic pepsinogen precursor (EC 3.4.23.-). 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Gallif ormes ; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=88227903; PubMed-3131317 ; 

RA Hayashi K. , Agata K., Mochii M. , Yasugi S., Eguchi G. , Mizuno T . ; 

RT "Molecular cloning and the nucleotide sequence of cDNA for embryonic 

RT chicken pepsinogen: phylogenetic relationship with prochymosin . " ; 

RL J. Biochem. 103:290-2 96(1988). 

CC -!- DEVELOPMENTAL STAGE: Specifically secreted during the embryonic 
CC period in the chicken proventriculus (glandular stomach) . 

CC -!- SIMILARITY: Belongs to peptidase family Al . 

CC 7 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; D00215; BAA00153.1; -. 

DR PIR; A41443; A41443. 

DR HSSP; P00794; 4 CMS . 

DR MEROPS; A01.028; 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR009007; Pept_A_acid. 

DR InterPro; IPR001461; Peptidase_Al . 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE; 2. 

KW Hydrolase; Aspartyl protease; Digestion; Signal; Glycoprotein. 

FT SIGNAL 1 16 POTENTIAL. 

FT CHAIN 17 383 EMBRYONIC PEPSINOGEN. 

FT ACT SITE 94 94 BY SIMILARITY. 



FT 


ACT SITE 


276 


276 


BY SIMILARITY. 






FT 


DISULFID 


107 


112 


BY SIMILARITY. 






FT 

J- J- 


DISULFID 


267 


271 


BY SIMILARITY. 






FT 


DISULFID 


310 


344 


BY SIMILARITY. 






FT 


CARBOHYD 


132 


132 


N- LINKED (GLCNAC. , 


. .) 


(POTENTIAL) 


FT 


CARBOHYD 


204 


204 


N-LINKED (GLCNAC. . 


. ■) 


(POTENTIAL) 


FT 


CARBOHYD 


309 


309 


N-LINKED (GLCNAC. . 


. .) 


(POTENTIAL) 


FT 


CARBOHYD 


350 


350 


N-LINKED (GLCNAC. . 


. -) 


(POTENTIAL) 


FT 


VARIANT 


51 


51 


T -> S. 






SQ 


SEQUENCE 


383 AA; 


41719 


MW; 1642796871611F54 


CRC64; 



Query Match 11.5%; Score 310; DB 1; Length 383; 

Best Local Similarity 26.8%; Pred. No. 3.5e-15; 

Matches 106; Conservative 63; Mismatches 136; Indels 90; Gaps 15; 

Qy 56 HA — DGLALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGS 113 

||||:||| III I I : I I I I I I : : I I I I 

Db 55 HAFPDVLTWTEPLL NTLDM EYYGTISIGTPPQDFTWFDTGS 97 

Qy 114 SNFAVAG TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGF 169 

|| | : I I : : | | | I : I I : : : : I I I I I I I I : 

Db 98 SNLWVPSVSCTSPACQSHQMFNPSQSSTYKSTGQNLSIHYGTGDMEGTVGCDTVTVASLM 157 



Qy 170 NTSFLWIATIFESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANI-PNVF 228 

: I : I : : I I I I : : I : : I I I I I I : I I : : I I : : I : : : I : I 

Db 158 DTNQLFGLST-SEPGQFFV-YVKFDGILGLGYPSLA— ADGITPVFDNMVNESLLEQNLF 213 

Qy 22 9 SMQMCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSL 288 

| : : : I : I I I I : I : I I : I : : I : I I : : : I : 

Db 214 SVYLSREPM GSMWFGGI DES YFTGS INWI PVS YQGYWQI SMDS I IVNKQEI 2 65 



Qy 289 NLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNS 348 

: : I I : I : I I : I : I : : I I I 

Db 266 ACS S GCQAI I DT GT S LVAG PAS DI N D I Q S AVG ANQ 300 

Qy 34 9 ETPWSYFPKISIYLRDENSSRSFRITILPQLYIQPMMGAGLNY ECY 3 94 

II I | : | : : : I I : I I 

Db 301 NTYGEY SVNCSHILAMPDWF — VIG-GIQYPVPALAYTEQNGQGTCM 345 



Qy 395 RFGISPSTNALVIGATVMEGFYVIFDRAQKRVGFA 429 

: I : : : I : : I I I II I I I I I 

Db 346 S S FQNS SADLWI LGDVFI RVYYSI FDRANNRVGLA 380 



RESULT 14 
CAT D_HUMAN 

ID CAT D_HUMAN ST7US1DARD; PRT; 412 AA. 

AC P07339; 

DT 01-APR-1988 (Rel. 07, Created) 

DT 01-APR-1988 (Rel. 07, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Cathepsin D precursor (EC 3.4.23.5). 

GN CTSD. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
OX NCBI TaxID-9606; 



RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=85270436; PubMed-3927292 ; 

RA Faust P.L., Kornfeld S. f Chirgwin J.M.; 

RT "Cloning and sequence analysis of cDNA for human cathepsin D."; 

RL Proc. Natl. Acad. Sci . U.S.A. 82:4910-4914(1985). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=87231068; PubMed-3588310 ; 

RA Westley B.R., May F.E.B.; 

RT "Oestrogen regulates cathepsin D mRNA levels in oestrogen responsive 

RT human breast cancer cells-"; 

RL Nucleic Acids Res. 15:3773-3786(1987). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=912 99158; PubMed=20697 17 ; 

RA Redecker B., Heckendorf B., Grosch H.W., Mersmann G. , Hasilik A. ; 

RT "Molecular organization of the human cathepsin D gene."; 

RL DNA Cell Biol. 10:423-431(1991). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney; 

RX MEDLINE-22388257; PubMed=12 477 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K . , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S. f Worley K.C., Hale S., Garcia A.M. f Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E. , Ketteman M. , Madan A., Rodrigues S. r Sanchez A. , 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G. f 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J. f Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2 002). 

RN [5] 

RP SEQUENCE OF 1-22 FROM N.A. 

RX MEDLINE=94085791; PubMed=826238 6 ; 

RA May F.E., Smith D.J., Westley B.R.; 

RT "The human cathepsin D-encoding gene is transcribed from an estrogen- 

RT regulated and a constitutive start point."; 

RL Gene 134:277-282(1993). 

RN [6] 

RP SEQUENCE OF 1-22 FROM N.A. 

RX MEDLINE=95021301; PubMed=7 935485; 

RA Augereau P., Miralles F., Cavailles V. , Gaudelet C, Parker M. , 

RA Rochefort H.; 

RT "Characterization of the proximal estrogen-responsive element of 

RT human cathepsin D gene.' 1 ; 



RL Mol. Endocrinol. 8:693-703(1994). 

RN [7] 

RP SEQUENCE OF 170-180. 

RC TISSUE=Liver; 

RA Hochstrasser D.F., Frutiger S., Paquet N. , Bairoch A., Ravier F. , 

RA Pasquali C, Sanchez. J.-C, Tissot J.-D., Bjellqvist B. , Vargas R. , 

RA Appel R.D., Hughes G.J.; 

RL Submitted (JUN-1992) to Swiss-Prot. 

RN [8] 

RP CARBOHYDRATE-LINKAGE SITE ASN-2 63. 

RX MEDLINE=22660472; PubMed=12754519 ; 

RA Zhang H., Li X.-J., Martin D.B., Aebersold R. ; 

RT "Identification and quantification of N-linked glycoproteins using 

RT hydrazide chemistry, stable isotope labeling and mass spectrometry."; 

RL Nat. Biotechnol. 21:660-666(2003). 

RN [9] 

RP VARIANT VAL-58. 

RX MEDLINE=20179010; PubMed=10716266; 

RA Papassotiropoulos A., Bagli M. , Kurz A. , Kornhuber J., Forstl H., 

RA Maier W., Pauls J. , Lautenschlager N., Heun R. ; 

RT "A genetic variation of cathepsin D is a major risk factor for 

RT Alzheimer's disease."; 

RL Ann. Neurol. 47:399-4 03(2000). 

RN [10] 

RP X-RAY CRYSTALLOGRAPHY (3 ANGSTROMS) . 

RC TISSUE=Spleen; 

RX MEDLINE=9 32 2 3670; PubMed=8 4 67789; 

RA Metcalf P., Fusek M. ; 

RT "Two crystal structures for cathepsin D: the lysosomal targeting 

RT signal and active site."; 

RL EMBO J. 12:1293-1302(1993). 

RN [11] 

RP X-RAY CRYSTALLOGRAPHY (2.5 ANGSTROMS). 

RC TISSUE=Liver; 

RX MEDLINE-93342076; PubMed=8393577 ; 

RA Baldwin E.T., Bhat T.N., Gulnik S., Hosur M.V., Sowder R.C. II, 

RA Cachau R.E., Collins J., Silva A.M., Erickson J.W. ; 

RT "Crystal structures of native and inhibited forms of human cathepsin 

RT D: implications for lysosomal targeting and drug design."; 

RL Proc. Natl. Acad. Sci. U.S.A. 90:6796-6800(1993). 

CC -!- FUNCTION: Acid protease active in intracellular protein breakdown. 
CC Involved in the pathogenesis of several diseases such as breast 

CC cancer and possibly Alzheimer's disease. 

CC -!- CATALYTIC ACTIVITY: Specificity similar to, but narrower than, 
CC that of pepsin A. Does not cleave the 4-Gln- | -His-5 bond in B 

CC chain of insulin. 

CC SUBUNIT: Consists of a light chain and a heavy chain. 

CC -!- SUBCELLULAR LOCATION: Lysosomal. 

CC -!- POLYMORPHISM: The Val-58 allele is significantly overrepresented 
CC in demented patients (11.8%) compared with nondemented controls 

CC (4.9%). Carriers of the Val-58 allele have a 3.1-fold increased 

CC risk for developing AD than noncarriers. 

CC -!- SIMILARITY: Belongs to peptidase family Al . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 
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use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 
EMBL 



M11233 
X05344 
M63138 
M63134 
M63135 
M63136 
M63137 



JOINED . 
JOINED. 
JOINED . 
JOINED. 

l; -. 



AAB59529. 1 
CAA28955. 1 
AAA51922. 1 
AAA51922.1 
AAA51922. 1 
AAA51922. 1 
AAA51922. 1 
BC016320; AAH16320 
L12980; AAA16314.1 
S74689; AAD14156.1 
S52557; AAD13868.1; 
PIR; A25771; KHHUD. 
PDB; 1LYA; 31-JAN-94 
PDB; 1LYB; 31-JAN-94 
PDB; 1LYW; 22-JUL-99 
MEROPS; A01.009; -. 
SWISS-2DPAGE; P07339 
Siena-2DPAGE; P07339; 
Genew; HGNC:2529; CTSD. 
MIM; 116840; 

GO; GO: 0004192; F:cathepsin D activity; TAS . 

InterPro; IPR001969; Aspprotease_AS . 

InterPro; IPR009007; Pept_A_acid. 

InterPro; IPR001461; Peptidase_Al . 

Pfam; PF00026; asp; 1. 

PRINTS; PR00792; PEPSIN. 

PROSITE; PS00141; ASP__PROTEASE; 2. 

Hydrolase; Aspartyl protease; Glycoprotein; Lysosome; Signal; Zymogen; 



HUMAN. 



Polymorphism 


l; Alzheimer 1 s 


disease; 3D-structure . 




SIGNAL 


1 


18 






PROPEP 


19 


64 


ACTIVATION PEPTIDE. 




CHAIN 


65 


412 


CATHEPSIN D. 




CHAIN 


65 


161 


CATHEPSIN D LIGHT CHAIN 


(PROBABLE) . 


CHAIN 


169 


412 


CATHEPSIN D HEAVY CHAIN 


(PROBABLE) . 


ACT SITE 


97 


97 






ACT_SITE 


295 


295 






DISULFID 


91 


160 






DISULFID 


110 


117 






DISULFID 


286 


290 






DISULFID 


329 


366 






CARBOHYD 


134 


134 


N-LINKED (GLCNAC. . .). 




CARBOHYD 


263 


263 


N-LINKED (GLCNAC. . .). 




VARIANT 


58 


58 


A -> V (ASSOCIATED WITH 


INCREASED RISK 








AD; POSSIBLY INFLUENCES 


SECRETION AND 








INTRACELLULAR MATURATION; dbSNP: 17571) 








/FTId=VAR_011621. 




STRAND 


67 


74 






TURN 


75 


77 






STRAND 


78 


85 






TURN 


86 


89 






STRAND 


90 


97 






TURN 


98 


99 







FT STRAND 103 107 

FT TURN 108 109 

FT TURN 112 113 

FT HELIX 115 118 

FT TURN 119 119 

FT STRAND 123 123 

FT HELIX 125 127 

FT TURN 129 130 

FT STRAND 132 141 

FT STRAND 146 158 

FT STRAND 172 184 

FT HELIX 188 192 

FT STRAND 197 200 

FT HELIX 204 206 

FT HELIX 208 210 

FT HELIX 214 220 

FT TURN 221 222 

FT STRAND 228 233 

Query Match 11.5%; Score 308.5; DB 1; Length 412; 

Best Local Similarity 27.1%; Pred. No. 5e-15; 

Matches 121; Conservative 75; Mismatches 180; Indels 71; Gaps 22; 

Qy 9 LL P LLAQWL L RAAP E LAP AP FT L P LRVAAATN RVVAP T P G PGTPAERHADGLAL 62 

I I I I I I I I I I : I I : I : : I | : : : : 

Db 6 LLPLAL — CLLAAP — ASALVRIPLHKFTSIRRTMSEVGGSVEDLIAKGPVSKYSQAVPA 61 

Qy 63 ALE P ALAS PAGAAN FLAMVDNLQGDSGRGYYLEMLI GT P PQKLQI LVDTGS SNFAVAGT P 122 

| : | | : : I I I : I I I I I I :: I I I I I I I 

Db 62 VTEGPI — PEVLKNYM DAQYYGEI GI GT P PQCFT WFDTGS SNLWVP S I H 109 

Q y 123 HSYIDT YFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIP— KGFNTSFL 174 

: | : : : : : | | | | I : I | | : I : : : I I : : I : I I 

Db HO CKLLDIACWIHHKYNSDKSSTYVKNGTSFDIHYGSGSLSGYLSQDTVSVPCQSASSASAL 169 

Qy 175 — VNIATIFESENFFLPGI KWNGILGLAYATLAKPSSSLETFFDSLVTQANI-PN 226 

| : | Ml I : : I I I I : I I : : : = : 11 = 1= I : I 

Db 17 0 GGVKVERQVFGEATKQPGITFIAAKFDGILGMAYPRIS— VNNVLPVFDNLMQQKLVDQN 227 

Qy 227 VFSMQMCGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQ 286 

: | | | llhlll: III = I : : I : I : : : : I : 

D b 228 IFSFY LSRDPDAQPGGELMLGGTDSKYYKGSLSYLNVTRKAYWQVHLDQVEV-AS 281 

Qy 287 SLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWT 346 

| | I : I : I I I I : I I : I : I I I I : I : : I 

Db 282 GLTL-CKE — GCEAIVDTGTSLMVGP VDEVRELQKAIGAVPLIQGEY MIPC— 329 

Qy 347 NSETPWSYFPKISIYLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRF GISPSTN 403 

| | | : : | : : : : : | : I : I I I 11 = 

Db 330 EKVSTLPAITLKL GGKGYKLS — PEDYTLKVSQAGKTLCLSGFMGMDIPPPSG 380 

Qy 404 AL- VT GATVMEGFYVI FDRAQKRVGFA 429 

| : : | : : I : II I I II I I 
Db 381 PLWILGDVFIGRYYTVFDRDNNRVGFA 407 



RESULT 15 



CATD_MOUSE 

ID CATD_MOUSE STANDARD; PRT; 410 AA. 

AC P18242; 

DT 01-NOV-1990 (Rel. 16, Created) 

DT 01-NOV-1990 (Rel. 16, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Cathepsin D precursor (EC 3.4.23.5). 

GN CTSD. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBIJTaxI D=l 0 090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Brain; 

RX MEDLINE=91088345; PubMed-2263503 ; 

RA Diedrich J.F., Staskus K.A. , Retzel E.F., Haase A.T.; 

RT "Nucleotide sequence of a cDNA encoding mouse cathepsin D."; 

RL Nucleic Acids Res. 18:7184-7184(1990). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=90326544; PubMed-2374732 ; 

RA Grusby M.J., Mitchell S.C., Glimcher L.H.; 

RT "Molecular cloning of mouse cathepsin D." ; 

RL Nucleic Acids Res. 18:4008-4008(1990). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; 

RX MEDLINE=94280622; PubMed=8 011168 ; 

RA Hetman M. , Perschl A., Saftig P., von Figura K., Peters C. ; 

RT "Mouse cathepsin D gene: molecular organization, characterization of 

RT the promoter, and chromosomal localization."; 

RL DNA Cell Biol. 13:419-427(1994). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE-Brain, and Mammary gland; 

RX MEDLINE=22388257; PubMed=12477 932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. , Max S.I., Wang J., Hsieh F., 

RA Diatchenko L . , Marusina K., Farmer A. A. , Rubin G.M. , Hong L . , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad- Sci. U.S.A. 99:16899-16903(2002). 
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!- FUNCTION: Acid protease active in intracellular protein breakdown. 
!- CATALYTIC ACTIVITY: Specificity similar to, but narrower than, 
that of pepsin A. Does not cleave the 4-Gln- | -His-5 bond in B 
chain of insulin. 

- SUBUNIT: Consists of a light chain and a heavy chain. 

- SUBCELLULAR LOCATION: Lysosomal. 

- SIMILARITY: Belongs to peptidase family Al . 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; X53337; CAA37423.1 
EMBL; X52886; CAA37067.1 
EMBL; X68378; CAA48453.1 
EMBL; X68379; CAA48453.1 
EMBL; X68380; CAA48453.1 
EMBL; X68381; CAA48453.1 
EMBL; X68382; CAA48453.1 
EMBL; X68383; CAA48453.1 
EMBL; BC054758; AAH54758 
EMBL; BC057931; AAH57931 
PIR; 148278; KHMSD. 
HSSP; P07339; 1LYB. 
MEROPS; A01.009; -. 
MGD; MGI: 88562; Ctsd. 

InterPro; IPR001969; Aspprotease_AS . 
InterPro; IPR009007; Pept_A_acid. 
InterPro; IPR001461; Peptidase_Al . 
Pfam; PF00026; asp; 1. 
PRINTS; PR00792; PEPSIN. 
PROSITE; PS00141; ASP_PROTEASE; 2. 

Hydrolase; Aspartyl protease; Glycoprotein; Lysosome; Zymogen; Signal. 



JOINED. 
JOINED. 
JOINED. 
JOINED. 
JOINED. 

1; -. 
l; 



FT 


SIGNAL 


1 


20 




POTENTIAL. 




FT 


PROPEP 


21 


64 




ACTIVATION PEPTIDE 


(POTENTIAL) . 


FT 


CHAIN 


65 


410 




CATHEPSIN D. 




FT 


ACT_SITE 


97 


97 




BY SIMILARITY. 




FT 


ACT SITE 


293 


293 




BY SIMILARITY. 




FT 


DISULFID 


91 


160 




BY SIMILARITY. 




FT 


DISULFID 


110 


117 




BY SIMILARITY. 




FT 


DISULFID 


284 


288 




BY SIMILARITY. 




FT 


DISULFID 


327 


364 




BY SIMILARITY. 


. ) (BY SIMILARITY) 


FT 


CARBOHYD 


134 


134 




N-LINKED (GLCNAC. . 


FT 


CARBOHYD 


261 


261 




N-LINKED (GLCNAC. . 


. ) (BY SIMILARITY) 


SQ 


SEQUENCE 


410 AA; 


44954 MW; 


DC4928EC46928BF0 


CRC64; 


Query Match 




11. 


4%; 


Score 306.5; DB 1; 


Length 410; 




Best Local 


Similarity 


27. 


5%; 


Pred. No. 6.9e-15; 





Matches 103; Conservative 64; Mismatches 123; Indels 85; Gaps 15; 



Qy 

Db 



92 YYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDT YFDTERSSTYRSKGFDV 145 

| | :: | | | I M : : I M I I I I : I : : : : : I I II I 

79 YYGDIGIGTPPQCFTWFDTGSSNLWVPSIHCKILDIACWVHHKYNSDKSSTYVKNGTSF 138 



Qy 146 TVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIAT IFESENFFLPGI KWNGIL 197 

: I I I : I : : : I I : : I : I I II I III I : : I I I 

Db 139 DIHYGSGSLSGYLSQDTVSVPCKSDQSKARGIKVEKQIF-GEATKQPGIVFVAAKFDGIL 197 

Qy 198 GLAYATLAKP S S S LET FFDS LVTQANI - PNVFSMQMCGAGLPVAGS GTNGGS LVLGGI EP 256 

I : I : : : : : I I : I : I : I : I I I I II 1 = 111 = 

Db 198 GMGYPHIS — VNNVLPVFDNLMQQKLVDKNI FS FY LNRDPEGQPGGELMLGGTDS 250 

Qy 257 SLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVF 316 

I I :: I : : I : I : : : I I : I : I I I : : I I I I : I I : I I I : 

Db 251 KYYHGELSYLNVTRKAYWQVHMDQLEVGNE-LTL-CK — GGCEAIVDTGTSLLVGPVEEV 306 

Qy 317 DAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITIL 376 

: : I : I I : I : : : I 

Db 307 KELQKAIGAVPLI QGEYMIPCEKVSSL 333 

Qy 377 PQLYIQPMMGAGLNYEC YRFGIS P S TNALVI GAT VMEG 414 

I : I : : : I I I I I I : I II : : I : 

Db 334 PTVYLK — LG-GKNYELHPDKYILKVSQGGKTICLSGFMGMDIPPPSGPLWILGDVFIGS 390 

Qy 415 FYVT FDRAQKRVGFA 429 

: I : I I I Mill 
Db 391 YYTVFDRDNNRVGFA 4 05 



Search completed: February 28, 2004, 09:09:47 
Job time : 13.7727 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



February 28, 2004, 09:05:15 ; Search time 41.4848 Seconds 

(without alignments) 
3939.714 Million cell updates/sec 



Title: US-09-668-314C-2 
Perfect score: 2687 

Sequence: 1 MGALARALLLPLLAQWLLRA. 



RPRDPEWNDESSLVRHRWK 518 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



1017041 seqs, 315518202 residues 



Total number of hits satisfying chosen parameters 



1017041 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : 



SPTREMBL 25:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 



sp_archea : * 
sp_bacteria : * 
sp_f ungi : * 
sp_human : * 
sp_invertebrate : * 
sp_mammal : * 
sp mhc : * 
sp_organelle : * 
sp_phage : * 

sp_plant : * 

sp_rodent : * 

sp_virus : * 

sp_vertebrate : * 

sp^unclassif ied: * 

sp_rvirus : * 

sp_bacteriap: * 

sp archeap : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 



1 


2405 


89. 


5 


514 


11 


Q8C5E9 


Q8c5e9 mus musculu 


2 


2399 


89. 


3 


514 


11 


Q8C793 


Q8c793 mus musculu 


3 


2395 


89. 


1 


514 


11 


Q9JL18 


Q9jll8 mus musculu 


4 


2375 


88 . 


4 


468 


4 


Q9NZL2 


Q9nzl2 homo sapien 


5 


2293 


85. 


3 


439 


4 


Q9H2V8 


Q9h2v8 homo sapien 


6 


2184 


81. 


3 


423 


4 


Q8N2D4 


Q8n2d4 homo sapien 


7 


1966 


73. 


2 


396 


4 


Q9NZL1 


Q9nzll homo sapien 


8 


1693 


63. 


0 


500 


13 


Q7T0Y2 


Q7t0y2 xenopus lae 




1246 


46 . 


4 


255 


11 


Q9R1P7 

XL. 


Q9rlp7 mus musculu 


10 

JL V 


1186 

JL -J— w X-/ 


44 . 


1 


501 


11 


Q8C7R1 


Q8c7rl mus musculu 


11 


1183 . 5 


44 . 


0 


532 


4 


Q9ULS1 


Q9ulsl homo sapien 


1 7 


1183 

J_ _L W *j 


44 . 


o 


501 


11 


Q8BQY4 


Q8bqy4 mus musculu 


1 3 


1172 5 


43 


6 


501 


4 


Q8IYC8 


Q8iyc8 homo sapien 


1 4 


i ri4 9 


39 


o 


467 


11 


Q8C4F4 


Q8c4f4 mus musculu 


1 s 

X J 


6S3 

vJ >J -J 


24 


3 


267 


11 


Q9CUU5 


Q9cuu5 mus musculu 


-L \J 


461 


17 . 


2 


213 


4 


Q9P0D2 


Q9p0d2 homo sapien 


1 7 


386 

\j \j 


14 . 


4 


244 


5 


Q8WQY9 


Q8wqy9 aphrocallis 


1 R 


367 5 


13 


7 


383 


13 


Q9DEC3 


Q9dec3 xenopus lae 


1 Q 


361 5 


13 . 


5 


389 


6 


Q9GMY4 


Q9qmy4 sorex ungui 




3 S S S 
j j j i j 


1 3 


2 


384 


13 


Q91322 


Q91322 rana catesb 


91 

eL X 


355 
-j 


13 . 


2 


389 


13 


Q9W643 


Q9w643 gallus gall 


9 9 

z. z. 


35 5 


13 . 

jl ^ » 


2 


389 


13 


Q9PWK1 


Q9pwkl gallus gall 


93 


351 5 


13 . 

-L • 


1 


389 


6 


Q9GMY3 


Q9gmy3 rhinolophus 




351 5 


13 . 

JL * 


1 

JL 


389 


6 


Q9GMY5 


Q9gmy5 suncus muri 


z. j 


335 5 


12 . 


5 


388 


6 


Q9GMY2 


Q9gmy2 oryctolagus 


96 


334 5 


12 


4 


391 


5 


Q9VKP6 


Q9vkp6 drosophila 


97 


396 


1 2 


1 

JL 


399 


13 


093458 


093458 podarcis si 


28 


324 5 


12 . 


1 


372 


5 


Q9VLK3 


Q9vlk3 drosophila 


9 Q 


39 4 5 


12 


1 

< JL 


383 


13 


Q9DE45 


Q9de45 salvelinus 


30 


392 5 


12 


o 


390 


6 


Q8SQ41 


Q8sq41 canis famil 


O X 


322 5 


12 


o 


397 


13 


Q9W6D4 


Q9w6d4 hynobius le 


39 


39 0 


11 


9 


385 


13 


Q9DEC4 


Q9dec4 rana catesb 


33 


319 5 


11 . 


, 9 


387 


13 


Q9DDV5 


Q9ddv5 salvelinus 


^4 


31 9 

O X J 


1 1 


9 


419 


5 


Q95VA2 


Q95va2 clonorchis 


o O 


O X O ■ J 


1 1 
X X < 


9 


396 


13 


093428 


093428 chionodraco 


36 


318 


11 ■ 


. 8 


397 


13 


Q800A0 


Q800a0 rana catesb 


37 


316 


11 . 

JL < 


. 8 


378 


13 


Q9PUR9 


Q9pur9 pseudopleur 




316 


11 , 


. 8 


392 


11 


Q9D7R7 


Q9d7r7 mus musculu 


39 


313 


11. 


.6 


383 


5 


076856 


076856 dictyosteli 


40 


312.5 


11, 


.6 


354 


5 


Q9GYX7 


Q9gyx7 boophilus m 


41 


305 


11. 


.4 


384 


13 


Q9DEC2 


Q9dec2 xenopus lae 


42 


302 


11, 


.2 


398 


13 


P87370 


P8737 0 oncorhynchu 


43 


301. 5 


11 


.2 


401 


11 


Q91X66 


Q91x66 mus musculu 


44 


300.5 


11 


.2 


386 


6 


Q9BGU5 


Q9bgu5 bos taurus 


45 


295. 5 


11 


. 0 


390 


6 


Q9GK10 


Q9gkl0 camelus dro 



ALIGNMENTS 



RESULT 1 
Q8C5E9 

ID Q8C5E9 PRELIMINARY 
AC Q8C5E9; 

DT 01-MAR-2003 (TrEMBLrel. 
DT 01-MAR-2003 (TrEMBLrel. 
DT 01-OCT-2003 (TrEMBLrel. 



PRT; 514 AA. 
23, Created) 

23, Last sequence update) 
25, Last annotation update) 



DE Beta-site APP-cleaving enzyme 2. 

GN BACE2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Testis ; 

RX MEDLINE=22354683; PubMed=124 66851; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK078770; BAC37384.1; -. 

DR MGD; MGI : 1860440; Bace2 . 

DR GO; GO:0004194; Frpepsin A activity; IEA. 

DR GO; GO:0006508; P : proteolysis and peptidolysis ; IEA. 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR009007; Pept_A_acid. 

DR Pfam; PF00026; asp; 1. 

DR PRINTS ; PR007 92 ; PEPSIN . 

DR PROSITE; PS00141; ASP_PROTEASE; 2. 

SQ SEQUENCE 514 AA; 55811 MW; CBB9237BB68A0B2E CRC64; 

Query Match 89.5%; Score 2405; DB 11; Length 514; 

Best Local Similarity 88.8%; Pred. No. 3.5e-179; 

Matches 460; Conservative 20; Mismatches 34; Indels 4; Gaps 1; 

Qy 1 MGALAR7^LLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

I I I I I I I I I I : I I I I I I I I I I I I I I I I I I : I I III : Mill I I I I 

Db 1 MGALLRALLLPVLAQWLLSAVPALAPAPFTLPLQVARATNHRASAVPGLGTPGLPRADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

| | | | M I : I I I I I I I I I I I I M I I I I I I I I I I I I I I I I : I I I I I M M I I I M I 

D b 61 ALALEPVRAT AN FLAMVDN LQGD S GRG Y YL EML I GT P P Q KVQ I LVDT G S S N FAVAG 116 

q y 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

| | | | | | | M I : I I I I I I I II I I I I I I I II I I I M I I I I I I I M I I I I : II I I I I I I I 

D b 117 APHSYIDTYFDSESSSTYHSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNSSFLVNIATI 176 

Q y 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

| | M | | | I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I :: I I I I I I I I I I I I I 

Db 177 FE S ENFFLPGI KWNGI LGLAYAALAK PSSSLETFFDS LVAQAKI P D I FSMQMCGAGL PVA 236 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

| | | | | | || I I I || I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 1:1 I M I I I I I I I I I 

Db 237 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQNLNLDCREYNADKA 2 96 

Qy . 301 IVDSGTTLLRLPQKVFDAWEAV7VRASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

| | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M : I I I M I I I I I II : I I I I I I I 

Db 297 IVDSGTTLLRLPQKVFDAWEAVARTSLIPEFSDGFWTGAQLACWTNSETPWAYFPKISI 356 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

M I II I : I I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I : I I 



Db 357 YLRDENASRS FRI T I LPQLYI QPMMGAGFN YEC YRFGI S S STNALVI GATVMEGFYWFD 416 



Qy 421 RAQKRVGFAAS P CAE I AGAAVS E I S GP FST EDVASN CVP AQ S L S E P I LWI VS YALMSVCG 480 

MhllMI I I I I I I I I I I I I I I I I I I I : I I I I I I I I : I : I I I I I I I I I I I I > I I I 

Db 417 RAQ RRVG FAVS P CAE IEGTTVSEISGPFSTEDIASN C VP AQ ALN E P I LW I VS YALM S VC G 47 6 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

1111111:11111 I : I I I I I I I I II I I I I I I I I I 

Db 477 AILLVLILLLLLPLHCRHAPRDPEWNDESSLVRHRWK 514 



RESULT 2 
Q8C793 

ID Q8C793 PRELIMINARY; PRT; 514 AA. 

AC Q8C793; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Beta-site APP-cleaving enzyme 2. 

GN BACE2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Heart ; 

RX MEDLINE=22354683; PubMed=124 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60, 77 0 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK052309; BAC34931.1; -- 

DR MGD; MGI: 1860440; Bace2 . 

DR GO; GO: 0004194; F:pepsin A activity; IEA. 

DR GO; GO: 0006508; P : proteolysis and peptidolysis ; IEA. 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR009007; Pept_A_acid. 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE; 2. 

SQ SEQUENCE 514 AA; 55871 MW; 8BF45E07B0990225 CRC64; 

Query Match 89.3%; Score 2399; DB 11; Length 514; 

Best Local Similarity 88.6%; Pred. No. le-178; 

Matches 459; Conservative 20; Mismatches 35; Indels 4; Gaps 1; 

Qy 1 MGAIiARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRVVAPTPGP 60 

I I I I I I I I I I : I I I I I I I I I I I I I I I I I I : I I Ml : I I Ml MM 

Db 1 MGALLRALLLPVTAQWLLSAVPALAPAPFTLPLQVARATNHRASAVPGLGTPELPRADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

| | | | M I : I I I I I II II M I I I I I M I II I I I I M M I : II I I I I M I II I M I 

Db 61 ALALEPVRAT ANFLAMVDNLQGDSGRGYYLEMLIGTPPQKVQILVDTGSSNFAVAG 116 



Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 18 0 

I I I I I I I I I 1 : I I I M I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I II : I I I I I I I I I 

Db 117 APHSYIDTYFDSESSSTYHSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNSSFLVNIATI 17 6 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I M II I I : : I I I II I I I I I I I I 

Db 177 FESENFFLPGI KWNG I LGLAYAALAK PSSSLETFFDS LVAQAK I P D I F SMQMC GAGL P VA 236 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I M I I I I I I II I I I I : I I I I I I I I I I I I I 

Db 237 GSGTNGGSLVLGGI EPSLYKGDIWYTPI KEEWYYQI EI LKLEI GGQNLNLDCREYNADKA 296 

Qy 301 IVDSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

M I I i I I I I II I I I II I I II I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I : I II I I I I 

Db 297 IVDSGTTLLRLPQKVFDAWEAVARTSLIPEFSDGFWTGAQLACWTNSETPWAYFPKISI 356 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

I I M I I : I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I : I I 

Db 357 YLRDENASRSFRTTILPQLYIQPMMGAGFNYECYRFGISSSTNALVIGATVMEGFYWFD 416 

Qy 421 RAQ K RVG FAAS P CAE I AGAAVS EISGPFSTE D VAS N C V P AQ SLSEPILWIVS YALM S VC G 480 

111:11111 I I I I I I I II I I I II I I I I I : I I I I I I I I : I : I I I I I I I II I I I I I I I 

Db 417 RAQ R RVG FAVS P CAE IEGTTVSEISGPFSTEDIASN C VP AQ ALN E P I LW I VS YALM SVC G 476 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I II I II I : I I I M I : I I I I I I I I I I I I I I II I M 
Db 477 AILLVLILLLLLPLHCRHAPRDPEWNDESSLVRHRWK 514 



RESULT 3 
Q9JL18 

ID Q9JL18 PRELIMINARY; PRT; 514 AA. 

AC Q9JL18; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 

DT 01-OCT-2000 (TrEMBLrel . 15, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Aspartyl protease 1. 

GN BACE2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Choi D.K., Sugano S., Sakaki Y. ; 

RT "Molecular characterization of the mouse Aspl gene, a homolog of the 

RT human ASP1 (Down Syndrome Region aspartyl protease)."; 

RL Submitted (DEC-1999) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO PEPTIDASE FAMILY Al . 

DR EMBL; AF216310; AAF36599.1; 

DR HSSP; P00797; 2 REN . 

DR MEROPS; A01.041; -. 

DR MGD; MGI: 1860440; Bace2 . 

DR GO; GO: 0004194; F:pepsin A activity; IEA. 

DR GO; GO: 0008233; F:peptidase activity; IEA. 

DR GO; GO:0006508; P :proteolysis and peptidolysis ; IEA. 

DR InterPro; IPR001969; Aspprotease AS. 



DR InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR009007; Pept_A_acid. 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE; 2. 

KW Aspartyl protease; Hydrolase; Protease. 

SQ SEQUENCE 514 AA; 55799 MW; A70725F2C1DF5B47 CRC64; 

Query Match 89.1%; Score 2395; DB 11; Length 514; 

Best Local Similarity 88.6%; Pred. No. 2.1e-178; 

Matches 459; Conservative 20; Mismatches 35; Indels 4; Gaps 1; 

Qy 1 MGALARALLLPLLAQWLLRAAPEIAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

I I I I II I I I : I I I I I I I I II I I I I I I I I : I I I I I : II III I I I I 

Db 1 MGALLRALLLLVIAQWLLSAVPAIAPAPFTLPLQVAGATNHRASAVPGLGTPELPRADGL 60 

Qy 61 ALALEPALASPAGAANFI^WDNLQGDSGRGYYLEMLIGTPPQKLQILVI)TGSSNFAVAG 120 

MINI I : I I I I I I I I II I I II II I I I II I II I I I I II : I I II I I I I I I I I I I I 

Db 61 ALALEPVRAT AN FLAMVDN LQ G D S GRG Y YLEML I GT P PQ KVQ I L VDTG S S N FAVAG 116 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

1111111111:1 MM II II II II II I I II I I II II II II I I M I I I M I II II II I 

Db 117 APHSYIDTYFDSESSSTYHSKGFDVTVKYTQGSWTGFVGEDLWIPKGFNSSFLVNIATI 176 

Qy 181 FESENFFLPGI KWN G I LGLAYAT LAK P S S S L ET FFD S LVTQAN I PNVF SMQMC GAGL P VA 240 

I I II II I M I II I I I II I I I II M M M I I M II I M I II M : : I I I M I I II II M 

Db 177 FESENFFLPGIKWNGILGLAYAALAKPSSSLETFFDSLVAQAKIPDIFSMQMCGAGLPVA 236 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I II I II I II II I II I I II I II II II II II I II I I M I I I M I II I : II I I I I II I I II I 
Db 237 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQNLNLDCREYNADKA 296 

Qy 301 IVDSGTTLLRLPQKYFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPW'SYFPKISI 360 

I I II I II II I II I II II II I I II II II II I I I I I II I I : I I I II I I I II M : II II II I 
Db 297 IVDSGTTLLRLPQKVFDAWEAVARTSLIPEFSDGFWTGAQLACWTNSETPWAYFPKISI 356 

Qy 361 YLRDEN S S RS FRI T I LPQLYI QPMMGAGLNYEC YRFGI S P STNALVI GATVMEGFYVI FD 420 

I I I M I : II II II II II II I I I M I II I I II M II II I II II II II I M II M II : II 

Db 357 YLRDENASRS FRI T I LPQLYI QPMMGAGFNYECYRFGI S S STNALVI GATVMEGFYWFD 416 

Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFST EDVAS N CVP AQ SLSEPILWIVS YALMS VC G 480 

II I : II II I MUM I I I I II I II I I II M I I I I I I I M : \ II I I I II I II II I II 

Db 417 RAQ RRVG FAVS P CAE I E GT TVS EISGPFST ED I AS N CVP AQALN E P I LW I VS YALMS VC G 476 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I I I I I I : I I I I I I : I II II I I I I II II I I II II 

Db 477 AILLVLILLLLLPLHCRHAPRDPEWNDESSLVRHRWK 514 



RESULT 4 
Q9NZL2 

ID Q9NZL2 PRELIMINARY; PRT; 468 AA. 

AC Q9NZL2; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 
DT 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 
DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 
DE Aspartyl protease. 



GN BACE2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-20422477; PubMed=10965118 ; 

RA Solans A. , Estivill X. , de La Luna S-; 

RT "A new aspartyl protease on 21q22.3, BACE2, is highly similar to 

RT Alzheimer 1 s amyloid precursor protein beta-secretase . " ; 

RL Cytogenet. Cell Genet. 89:177-184(2000). 

CC -!- SIMILARITY: BELONGS TO PEPTIDASE FAMILY Al . 

DR EMBL; AF188276; AAF35835.1; -. 

DR HSSP; P00797; 2 REN . 

DR GO; GO:0004194; F:pepsin A activity; IEA. 

DR GO; GO: 0008233; F: peptidase activity; IEA. 

DR GO; GO:0006508; P : proteolysis and peptidolysis; IEA. 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR009007; Pept_A_acid. 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE; 2. 

KW Aspartyl protease; Hydrolase; Protease. 

SQ SEQUENCE 468 AA; 50324 MW; 717E0920126A0142 CRC64; 

Query Match 88.4%; Score 2375; DB 4; Length 468; 

Best Local Similarity 90.3%; Pred. No. 6.6e-177; 

Matches 468; Conservative 0; Mismatches 0; Indels 50; Gaps 1 

Qy 1 MGALARALLLPLIAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

i I I I I I I I I I I I I I ! I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MGALARALLL P L LAQW L LRAAP ELAP AP FT L P LRVAAATN RWAP T P G P GT P AERHAD GL 60 

Qy 61 ALALEPALAS PAGAANFLAMVI)NLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 12 0 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I II I I 1 I I I II I I I I M I I I I I I 

Db 61 ALALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 18 0 

I I I I I I II I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATIAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

II M M I I M M I M I I I I M I I M M M I I I I I I I II I I M M II M I I M I I I I II M 

Db 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I II I II I I II I I I I I I I I I I I I I I I 
Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I I I II I I I I I I II I II I I I II I I II II 

Db 301 I VD SGTTLLRLPQ KVF D AWEAVARAS L 328 



Qy 



361 YLRDENSSRSFRITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFD 420 

I I I I I II I I M I I I I I I I I I I I M I I I II II I I II M I I I I I 



Db 



329 



LYIQPMMGAGLN YECYRFGI S PSTNALVI GATVMEGFYVI FD 37 0 



Qy 421 RAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE D VASN C VP AQ S L S E P I LW I VS YALMS VC G 480 

I I I I I I I I I II I I II I I I I I II I I I I I I | I I | I I I I I I I I I I I I I I I I M I M I I I I I II 

Db 371 RAQKRVG FAAS PCAEI AGAAVS E I S GP FS T EDVASNC VPAQS LS E P I LWI VS YALMS VCG 430 

Qy 481 AILLVLIVLLLLPFRCQRRPRDPEWNDESSLVRHRWK 518 

I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 431 AILLVLIVLLLLPFRCQRRPRDPEVVNDESSLVRHRWK 468 



RESULT 5 
Q9H2V8 

ID Q9H2V8 PRELIMINARY; PRT; 439 AA. 

AC Q9H2V8; 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE CDA13. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Pheochromocytorna; 

RA Li Y., Huang Q., Peng, y, Song H., Yu Y., Xu S., Ren S., Chen Z., 

RA Han Z . ; 

RL Submitted (DEC-1999) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO PEPTIDASE FAMILY Al . 

DR EMBL; AF212252; AAG41783.1; -. 

DR HSSP; P00797; 2 REN . 

DR GO; GO: 0004194; F:pepsin A activity; IEA. 

DR GO; GO: 0008233; F:peptidase activity; IEA. 

DR GO; GO: 0006508; P : proteolysis and peptidolysis ; IEA. 

DR InterPro; IPR001969; Aspprotease__AS . 

DR InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR009007; Pept_A_acid. 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE; 2. 

KW Aspartyl protease; Hydrolase; Protease. 

SQ SEQUENCE 439 AA; 48275 MW; 02ECOEOE50F11602 CRC64; 

Query Match 85.3%; Score 2293; DB 4; Length 439; 

Best Local Similarity 100.0%; Pred. No. 1.5e-170; 

Matches 439; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 0 MVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYFDTERSSTYR 139 

I I I II I II I I I II I I I I I I I II I I I I I III M II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYFDTERSSTYR 60 

Qy 140 SKGFDVTVT<YTQGSWTGEVGEDLVTIPKGFNTSFLVNIATIFESENFFLPGIKWNGILGL 199 

I I I I I I I I II I I I I I I I II II I I II I I I I I II I I I I I I I I I I I I I I I I I I II I I I II I I I 

Db 61 SKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPGIKWNGILGL 120 



Qy 200 AYAT LAKP S S S LET FFDS LVTQAN I PNVFSMQMCGAGL PVAGS GTN GGS LVLGGI EPS LY 259 



M II I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M II 

Db 121 AYATLAKPSSSLETFFDSLOTQANIPNVFSMQMCGAGLPVAGSGTNGGSLVLGGIEPSLY 180 

Qy 260 KGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAV 319 

I M M M I M II II M M M I I I M M II I M M M M II M I I M M M I I II I II I II 

Db 181 KGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAV 240 

Qy 320 VEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITILPQL 379 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I I I II I I I I I I I I I M I 

Db 241 VEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITILPQL 300 

Qy 380 YI Q PMMGAGLN YECYRFGISPS TNAL VI GAT VMEG F YVT FD RAQKRVG FAAS P CAE I AGA 439 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I II I I II I I I I I I 

Db 301 YIQPMMGAGLNYECYRFGI S P STNALVI GATVMEGFYVI FDRAQKRVGFAAS PCAEI AGA 360 

Qy 440 AVS E I S G P F S T EDVASNCVPAQ SLSEPILWI VS YALMS VCGAI LLVL I VL LLL P FRCQ RR 499 

II I M M I I II I I II I I I II I II I I I II II I II I I M I I I II I I I I II I I I I II I I I I I I 

Db 361 AVS EISGPFSTE D VAS N C VP AQ SLSEPILWIVS YALM S VC GAI L L VL I VL LLLPFRCQRR 420 

Qy 500 PRDPEWNDESSLVRHRWK 518 

I I I I I I I I I I II I I I I I I I 

Db 421 PRDPEWNDESSLVRHRWK 439 



RESULT 6 
Q8N2D4 

ID Q8N2D4 PRELIMINARY; PRT; 423 AA. 

AC Q8N2D4; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein OVARC1000363 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Ovarian carcinoma; 

RA Ota T . , Nishikawa T., Suzuki Y. , Kawai-Hio Y. , Hayashi K. , Ishii S., 

RA Saito K., Yamamoto J., Wakamatsu A., Nagai T., Nakamura Y., 

RA Nagahari K., Sugano S., Isogai T.; 

RT "HRI human cDNA sequencing project."; 

RL Submitted (MAR-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AK075539; BAC11682.1; -. 

DR GO; GO: 0004194; F:pepsin A activity; IEA. 

DR GO; GO:0006508; P : proteolysis and peptidolysis; IEA. 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR009007; Pept_A_acid. 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE; 2. 

KW Hypothetical protein. 

SQ SEQUENCE 423 AA; 46457 MW; 4D4839F2ED9C2CE1 CRC64; 



Query Match 



81.3%; Score 2184; DB 4; Length 423; 



Best Local Similarity 99.3%; Pred. No. 4.7e-162; 

Matches 420; Conservative 1; Mismatches 2; Indels 0; Gaps 



Qy 96 MLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWT 155 

| | | I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I II 1 I I I I I 

Db 1 MLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYFDTERSSTYRSKGFDWVKYTQGSWT 60 

Qy 156 GFVGEDLVTIPKGFNTSFLVNIATIFESENFFLPGIKWNGILGLAYATLAKPSSSLETFF 215 

I | | M | | I I I I I I I I I I I II I I II I I I I : I I I I I I I I I I I I I I I I I I I I I M 

Db 61 GFVGEDLVTIPKGFNTSFLWIATIFESGNFFLPGIQWNGILGLAYATLAKPSSSLETFF 120 

Qy 216 DS LVTQAN I PNVFSMQMCGAGL PVAGS GTNGGS LVLGGI EP S LYKGDI WYT P I KEEWYYQ 275 

| | I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I 
Db 121 DSLVTQANIPNVFSMQMRGAGLPVAGSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQ 180 

Qy 276 IEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDG 335 

M | | | | | | I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I M I I I I I I I I I I I I I I I I M 

D b 181 IEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDG 240 

Qy 336 FWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITILPQLYIQPMMGAGLNYECYR 395 

| | I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 

Db 241 FWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITILPQLYIQPMMGAGLNYECYR 300 

Q y 396 FGI SPSTNALVIGATVMEGFYVI FDRAQKRVGFAASPCAEIAGAAVSEI SGPFSTEDVAS 455 

I | | | | | | | || | I I I I I I I M I I I I I I I I I M I II I I I I I I I I I I I I I I M I I I I I I I I I I 

Db 301 FG I S P S TNALVI GAT VME G F YVI FDRAQ KRVG FAAS P C AE I AGAAVS E I S G P F S T EDVAS 360 

Qy 456 NCVPAQSLSEPILWIVSYALMSVCGAILLVLIVLLLLPFRCQRRPRDPEVWDESSLVRH 515 

| | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 361 NCVPAQSLSEPILWIVSYALMSVCGAILLVLIVLLLLPFRCQRRPRDPEVWDESSLVRH 420 

Qy 516 RWK 518 

I I I 

Db 421 RWK 423 



RESULT 7 
Q9NZL1 

ID Q9NZL1 PRELIMINARY; PRT; 396 AA. 

AC Q9NZL1 ; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 

DT 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Aspartyl protease. 

GN BACE2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBIJTaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-20422477; PubMed-10965118 ; 

RA Solans A., Estivill X., de La Luna S.; 

RT "A new aspartyl protease on 21q22.3, BACE2, is highly similar to 

RT Alzheimer's amyloid precursor protein beta-secretase . " ; 

RL Cytogenet. Cell Genet. 89:177-184(2000). 

CC -!- SIMILARITY: BELONGS TO PEPTIDASE FAMILY Al . 



DR EMBL ; AF188277; AAF35836.1; 

DR HSSP; P00797; 2 REN. 

DR GO; GO: 0004194; F;pepsin A activity; IEA. 

DR GO; GO: 0008233; Ftpeptidase activity; IEA. 

DR GO; GO: 0006508; P .-proteolysis and peptidolysis ; IEA. 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR009007; Pept_A_acid. 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE; 2. 

KW Aspartyl protease; Hydrolase; Protease. 

SQ SEQUENCE 396 AA; 43013 MW; 5023A7AF391CEAC9 CRC64; 

Query Match 73.2%; Score 1966; DB 4; Length 396; 

Best Local Similarity 100,0%; Pred. No. 4.5e-145; 

Matches 37 8; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MGALARALLL P LLAQWLLRAAP ELAP AP FT L P LRVAAATNRWAPT P GP GT PAERHADGL 60 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MGALARALLLPLLAQWLLRAAPELAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGL 60 

Qy 61 ALALEPALAS PAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAG 120 

I I I I II I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 61 ALALEPALAS PAGAAN FLAMVDNLQGDS GRGYYLEMLI GT P PQKLQI LVDTGS SNFAVAG 120 

Qy 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

II II II I I I I I II I I I I I I I I I I II I I II I I II II I I II I II I I I I I I II I I I I I I II I I 

Db 121 TPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATI 180 

Qy 181 FESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

I I I I I I I I I I I I I I II I I I II I I I I I I I I I I II II I I I I I II I I I I I I I I I I I II I I I I I 
Db 181 FESENFFLPGIKWNGILGLAYATIAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVA 240 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I I II I II I II II II I II II I M I I I M I M M I I I I M I II I II I M M I I I II I M I II 

Db 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

Qy 361 YLRDENSSRSFRITILPQ 378 

II I I II I I M II M I I I I 

Db 361 YLRDENSSRSFRITILPQ 378 



RESULT 8 
Q7T0Y2 

ID Q7T0Y2 PRELIMINARY; PRT; 500 AA. 

AC Q7T0Y2; 

DT 01-OCT-2003 (TrEMBLrel. 25, Created) 
DT 01-OCT-2003 (TrEMBLrel. 25, Last sequence update) 
DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 
DE Hypothetical protein. 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 



OC Amphibia; Batrachia; Anura; Mesobatrachia ; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus . 

OX NCBI_TaxID=8355; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Embryo; 

RX MEDLINE=22341132; PubMed=12454917 ; 

RA Klein S.L., Strausberg R.L., Wagner L., Pontius J., Clifton S.W., 

RA Richardson P.; 

RT "Genetic and genomic tools for Xenopus research: The NIH Xenopus 

RT initiative."; 

RL Dev. Dyn. 225:384-391(2002). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Embryo; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H*, Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T . , Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E. f Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J. f Myers R.M. , Butterfield Y.S., 

RA Krzywinski M.I., Skalska U. , Smailus D.E., Schnerch A. , Schein J.E., 

RA Jones S.J., Marra M.A. ; 

RT "Generation and initial analysis of more than 15 f 000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Embryo; 

RA Klein S., Strausberg R. ; 

RL Submitted (AUG-2003) to the EMBL/GenBank/DDB J databases. 

DR EMBL; BC055989; AAH55989.1; 

KW Hypothetical protein. 

SQ SEQUENCE 500 AA; 54722 MW; 10F16756CAFDCD0B CRC64; 

Query Match 63.0%; Score 1693; DB 13; Length 500; 

Best Local Similarity 63.6%; Pred. No. 1.4e-123; 

Matches 322; Conservative 74; Mismatches 100; Indels 10; Gaps 4 

Qy 13 LAQW L L RAAP E LAP AP FT L P L RVAAATN RWAP T P G P GT P AE RHAD GLALAL E P ALAS PA 72 

1:111 I I : I I I : I : I III:: I I I I : I 

Db 5 LVRLLLLCAAACASNKFIVPLNVSPAEIKGTLPV-APATPKDK — PGLLLA SDPG 56 

Qy 73 GAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHSYIDTYFDT 132 

I II : I I I I I I I I I I I I II I : I I I : I I I I : I I I I I I I I I I I I I I : I : : : I : I I : 

Db 57 GTINFFSMVDNLAGDSGRGYYLELLIGSPPQKVNILVDTGSSNFAVAGSPNPDVNTFFDS 116 



Qy 133 ERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLWIATIFESENFFLPGIK 192 

: I : : I : I : I I I : I I I I I I I I : I : I : I : I I I I I : I I : I I I : I I : I I : I I I I I 
Db 117 KLSTSYQSLNTEWVRYTQGSWTGLLGKDWSIPKGVNGTFLINIASIFQSESFFLPNIN 176 

Qy 193 WNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGSGTNGGSLVLG 252 

I I I I I I I I : I I I I I I I I : I I I I I I I I I I I : I I I I I I I I II I : I I I I I I I I 

Db 177 WQGILGLAYSTLAKPSSSVEPFFDSLVQQENIPDVFSMQMCGAGQSSPGNGINAGSLVLG 236 

Qy 253 GIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLP 312 

I : I I I I I I I : II I I I I I I I I II : I : I I I : II I I I I I I II : I I I I I I I I I I II I I I 

Db 237 GVEPSLYKGNIWYTPITEEWYYQVEVLKFEVGGQRLNLDCTVYNSDKAIVDSGTTLLRLP 296 

Qy 313 QKVFDAVVEAVARAS LI PEFS DGFWT GS QLACWTNS ET PWS YFPKI S I YLRDENS S RS FR 372 

I I I : I : I : I : : III I : II I Mill : : I I : I I I II I I I II I : II I I I 
Db 297 DKVFNAMVDAI VQT S LI QNFNAEFWAGLQLACWDKTQQPWNYFPDI SI YLRDTNTSRS FR 356 

Qy 373 ITILPQLYIQPMMGAGLNYECYRFGISPSTNALVT GATVMEGFYVIFDRAQKRVGFAASP 432 

: I : I I I I I I : : : I : I I I I I I : I I II I I I I I I I I I I I I I I : I I I I I I I 

Db 357 LTLKPQLYIQSVLTFQESLNCFRFGISQSASTLVIGATVMEGFYVIFDR7VEKRVGFAVSS 416 

Qy 433 CAE I AGAAVS EISGPFSTE D VAS N C VP AQ S L S E P I LW I VS Y ALM S VC GAI L L VL I VL L L L 4 92 

I I I : : I Nihil! I I I : I I I : I i I I : I I : I I : I I I : II II I I I : : I I I I 

Db 417 CAEVSGITVSEIAGPFGTSDVSSNCIARNPLREPIMWIISYSLMSLCGMILLVLVILLLL 476 

Qy 493 PFRCQRRPRDPEWNDESSLVRHRWK 518 

I : I I I : I I I I II I : I I I I 
Db 477 SNR— QRHDDMETINDESSLVQHRWK 500 



RESULT 9 
Q9R1P7 

ID Q9R1P7 PRELIMINARY; PRT; 255 AA. 

AC Q9R1P7; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Aspartyl protease (Fragment) . 

GN BACE2. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [11 

RP SEQUENCE FROM N.A. 

RA Accarino M. , Fumagalli P., Taramelli R. , Ottolenghi S.; 

RT "Cloning of a gene from chromosome 21 Down Region encoding a potential 

RT transmembrane protease."; 

RL Submitted (FEB-1998) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF051150; AAD45964.1; 

DR MEROPS; A01.041; -. 

DR MGD; MGI: 1860440; Bace2 . 

DR GO; GO: 0004190; F: aspartic-type endopeptidase activity; IEA. 

DR GO; GO: 0008233; F:peptidase activity; IEA. 

DR GO; GO:0006508; P : proteolysis and peptidolysis ; IEA. 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR009007; Pept_A_acid. 

DR PROSITE; PS00141; ASP PROTEASE; 1. 



KW Protease. 

FT NON_TER 1 1 

SQ SEQUENCE 255 AA; 28685 MW; 53DE317815996D63 CRC64 ; 

Query Match 46.4%; Score 1246; DB 11; Length 255; 

Best Local Similarity 91.0%; Pred. No. 4e-89; 

Matches 232; Conservative 12; Mismatches 11; Indels 0; Gaps 0; 

Qy 264 WYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAV 323 

I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 WYTPIKEEWYYQIEILKLEIGGQNLNLDCREYNADKAIVDSGTTLLRLPQKVFDAWEAV 60 

Qy 324 ARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFRITILPQLYIQP 383 

II I I I I I I I I I I I I I : I I I I I I I I I I I I : I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I 

Db 61 ARTSLI PEFSDGFWTGAQLACWTNSETPWAYFPKI S I YLRDENASRS FRITI LPQLYIQP 120 

Qy 384 MMGAGLN YEC Y RFG I S P S TNALVI GAT VMEG F YVI F DRAQ KRVG FAAS P CAE I AGAAVS E 443 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I : I I I I I I I II I I I III 

Db 121 MMGAG FNYECYRFGISSS TNALVI GAT VMEG FYWFDRAQ RRVG FAVS P CAE I E GT TVS E 180 

Qy 444 ISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCGAILLVLIVLLLLPFRCQRRPRDP 503 

I I I I I I I I I : I I I I I I I I : I : I I I I II I I I I I I I I I I I I I I I II : I I I : I I : I I I I 

Db 181 ISGPFSTEDIASNCVPAQALNEPILWIVSYALMSVCGAILLVLILLLLVPLHCRHAPRDP 240 

Qy 504 EWNDES SLVRHRWK 518 

I I I I I I II I I I II I I 

Db 241 EWNDES SLVRHRWK 255 



RESULT 10 
Q8C7R1 

ID Q8C7R1 PRELIMINARY; PRT; 501 AA. 

AC Q8C7R1; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Beta-site APP cleaving enzyme. 

GN BACE . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Spinal cord; 

RX MEDLINE-22354683; PubMed-124 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK049626; BAC33844.1; 

DR MGD; MGI: 134 6542; Bace. 

DR GO; GO:0004194; Frpepsin A activity; IEA. 

DR GO; GO:0006508; P : proteolysis and peptidolysis; IEA. 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR001461; Peptidase Al . 



DR Inter Pro; IPR009007; Pept_A_acid. 
DR Pfam; PF00026; asp; 1. 
DR PRINTS; PR00792; PEPSIN. 



DR 
SQ 


PROSITE; 
SEQUENCE 


PS00141; ASP PROTEASE; 1. 

501 AA; 55761 MW; B410DA8B64647663 CRC64; 




Query Match 44.1%; Score 1186; DB 11; Length 501; 
Best Local Similarity 46.2%; Pred. No. 5.4e-84; 

Matches 238; Conservative 82; Mismatches 169; Indels 26; Gaps 


7; 


Qy 


Q 


LLPLLAQWLLRAAPELAPAPFT LPLRVAAATNRWAPTPGPGTPAERHADGLALA 

: I 1 II : 1 1 1 1 1 1 1 1 II II : 

MAP AT HWT.T.T.WVGSGMLPAOGTHLGIRLPLRSGLA GPPLGLRLPRETDEES 


63 


UD 


X 


51 


Qy 


Oft 


LEPALAS PAGAANFIAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPH 

| : | : I II II : 1 1 1 : 1 1 1 : 1 1 11 = 111 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

FFPGRRGS FVEMVDN L RGK S GO G Y YVEMT I G S P P QT LN I LVDT G S S N FAVG AAP H 


123 


Dd 


to 
JZ 


106 


Qy 


19/1 


SYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFES 

: : | : : I I 1 1 1 1 1 II II 1 1 M 1 1 1 M 1 1 1 : 1 1 1 1 1 1 

PFT HRYYOROT.SSTYRDLRKGVYVPYTOGKWEGELGTDLVSIPHGPNVTVRANIAAITES 


183 


DD 


L U / 


166 


Qy 


1 Rd 

X. O *i 


EN F FL P G I KWN GIL GLAYAT LAK PSSSLETFFDS L VT QAN I PNVF SMQMC GAGL P V A 

: ||: | | I 1 II 1 1 1 1 : 1 : 1 1 1 1 1 1 1 1 1 1 1 : 1 M : 1 i : 1 : 1 1 1 1 1 : 

DKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTE 


240 


UD 


1 67 


226 


Pitt 

Qy 




GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYN7\DKA 

: | | | : : : | | | : III 1 : 1 1 1 1 1 : MM— 1 : : : 1 1 M 1 : II : 1 II 1 1 : 

7VLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKS 


300 


Db 


227 


286 


Qy 


301 


IVDSGTTLLRLPQIWFDAWEAVARASLIPEFSDGi^TGSQx^CWTNSETPWSYFPKISI 

MMIM llll:IM:l I-: 1 1 M MM 1 II II MM II IM 

IVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISL 


360 


Db 


287 


346 


Qy 


361 


YLRDENS S RS FRI T I LPQL YI Q PMMGAGLN Y- EC YRFGI S P S TNALVI GATVMEGFYVI F 

|| | : : : | I 1 II 1 1 II M : 1 : : M 1 : 1 : 1 1 : M 1 1 M 1 1 II M 1 

YLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWF 


419 


Db 


347 


406 


Qy 


420 


DRAQKRVGFAAS P CAE I AGAAVS EISGPFSTE DVAS N CVP AQ S L S E P I LW I VS YALMS VC 

II 1 : 1 1 : 1 1 1 1 1 : : 1 II 1 1 : 1 1 : : 1 : : : 1 

DRARKRI G FAVS ACHVHDE FRT AAVE G P FVT ADME D C G YN IPQTDEST LMT I AYVMAAI C 


479 


Db 


407 


466 


Qy 


480 


GAILLVLIVLLLLPFRCQR — RPRDPEWNDESSL 512 




Db 


467 


I : : : : 1 : : Ml 1 1 : : MM 

-ALFMLPLCLMVCQWRCLRCLRHQHDDFADDI SLL 500 





RESULT 11 
Q9ULS1 

ID Q9ULS1 PRELIMINARY; PRT; 532 AA. 

AC Q9ULS1; 

DT 01-MAY-2000 (TrEMBLrel. 13 r Created) 

DT 01-OCT-2001 (TrEMBLrel. 18, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein KIAA1149 (Fragment) . 

GN KIAA1149. 

OS Homo sapiens (Human) . 



OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=20039618; PubMed=105744 61 ; 

RA Hirosawa M. , Nagase T. , Ishikawa K., Kikuno R. , Nomura N., Ohara O.; 

RT "Characterization of cDNA clones selected by the GeneMark analysis 

RT from size-fractionated cDNA libraries from human brain."; 

RL DNA Res. 6:329-336(1999). 

CC -!- SIMILARITY: BELONGS TO PEPTIDASE FAMILY Al . 

DR EMBL; AB032975; BAA86463.2; -. 

DR HSSP; P56272; 1AM5. 

DR GO; GO: 0004194; F:pepsin A activity; IEA. 

DR GO; GO: 0008233; F:peptidase activity; IEA. 

DR GO; GO:0006508; P : proteolysis and peptidolysis; IEA. 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR009007; Pept_A_acid. 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE; 1. 

KW Hypothetical protein; Aspartyl protease; Hydrolase; Protease. 

FT NON_TER 1 1 

SQ SEQUENCE 532 AA; 58720 MW; 98B135D0D5FBD2E8 CRC64; 

Query Match 44.0%; Score 1183.5; DB 4; Length 532; 

Best Local Similarity 47.8%; Pred. No. 9.3e-84; 

Matches 231; Conservative 81; Mismatches 154; Indels 17; Gaps 6; 

44 APTPGPGTPAERHADGLALAL EPALAS PAGAANFLAMVDNLQGDSGRGYYLE 95 

| : || | || II I : I : I I II I : I I I : I I I : I 

52 APSTASGCPCAAAWGGAPLGLRLPRETDEEP — EEPGRRGSFVEMVDNLRGKSGQGYYVE 109 

ILI GT PPQKLQI LVDTGS SNFAVAGTPHS YI DT YFDTERS ST YRSKGFDVTVKYTQGSWT 155 
: I : I || I I I I I I I I I I I I I II:: I : : I I I I I I I I I II I 



Qy 


44 


Db 


52 


Qy 


96 


Db 


110 


Qy 


156 


Db 


170 


Qy 


216 


Db 


230 


Qy 


273 


Db 


290 


Qy 


333 


Db 


350 


Qy 


392 



| | | : I I I I : I I I I I I : I I : I I I I I I I I I I : I : I I I I I I 



| | I I | : : 11 : I I : I : I I I I I : I : I I I : : : I I I = III I MINI: I I 



||:: | : : : I I II I : I I : I I I I I : I I I I I I I I I I I : I I I : I I 



I I I I I I I I I I : I I II : I I I : : : I I I I I I I I I I : : I 



392 ECYRFGI S PSTNALVI GATVMEGFYVI FDRAQKRVGFAAS PCAEIAGAAVSEI SGP FSTE 451 
: | | : I || I : I : I I : I I I I I I : I I I I : I I : I I I I I : : I I I I 



Db 410 DCYKFAISQSSTGTVMGAVIMEGFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTL 469 



Qy 452 DVASNCVPAQSLSEPILWIVSYALMSVCGAILLVXIVLLLLPFRCQR — RPRDPEWNDE 509 

I : I I : : | : : : I | : : : : I : : : | | | I : : : I 

Db 470 DMEDCGYNIPQTDESTLMTIAYVMAAIC-ALFMLPLCLMVCQWRCLRCLRQQHDDFADDI 528 



Qy 


510 SSL 512 






Db 


1 1 

529 SLL 531 






RESULT 12 






Q8BQY4 






ID 


Q8BQY4 PRELIMINARY; PRT; 501 AA. 






AC 


Q8BQY4; 






DT 


01-MAR-2003 (TrEMBLrel. 23, Created) 






DT 


01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 






DT 


01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 






DE 


Beta-site APP cleaving enzyme. 






GN 


BACE. 






OS 


Mus musculus (Mouse) . 






OC 


Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 


Euteleostomi; 


OC 


Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; 


Murinae; Mus. 


OX 


NCBI TaxID=10090; 






RN 


[1] 






RP 


SEQUENCE FROM N.A. 






RC 


STRAIN=C57BL/6J; TISSUE=Brain; 






RX 


MEDLINE=22 354683; PubMed-124 66851 ; 






RA 


The FANTOM Consortium, 






RA 


the RIKEN Genome Exploration Research Group Phase I 


& II 


Team; 


RT 


"Analysis of the mouse transcriptome based on functional 


annotation 


RT 


60, 770 full-length cDNAs . " ; 






RL 


Nature 420:563-573(2002). 






DR 


EMBL; AK046175; BAC32620.1; -. 






DR 


MGD; MGI: 1346542; Bace. 






DR 


GO; GO: 0004194; F:pepsin A activity; IEA. 






DR 


GO; GO:0006508; P : proteolysis and peptidolysis ; IEA. 






DR 


InterPro; IPR001969; Aspprotease_AS . 






DR 


InterPro; IPR001461; Peptidase Al . 






DR 


InterPro; IPR009007; Pept_A_acid. 






DR 


Pfam; PF00026; asp; 1. 






DR 


PRINTS; PR00792; PEPSIN. 






DR 


PROSITE; PS00141; ASP PROTEASE; 1. 






SQ 


SEQUENCE 501 AA; 55816 MW; C0855513145E024E CRC64; 





Query Match 44.0%; Score 1183; DB 11; Length 501; 

Best Local Similarity 46.0%; Pred. No. 9.2e-84; 

Matches 237; Conservative 83; Mismatches 169; Indels 26; Gaps 7 

Qy 9 LL P L LAQWL L RAAP E LAP AP FT LPLRV7WYTNRWAPTPGPGTPAERHADGLALA 63 

: I I II : I I I I I I I I II II : 

Db 1 MAPALHWLLLWVGSGMLPAQGTHLGIRLPLRSGLA GPPLGLRLPRETDEES 51 

Qy 64 LEPALAS PAGAAN FLAMVDNLQGDS GRGYYLEMLI GT P PQKLQI L VDTGS SNFAVAGT PH 123 

I : I : I I I I I : I I I : I I I : I I : I : I I I I I I I I I I I I I I I I II 

Db 52 E E P G RRG S FVEMVDN LRGK S GQ G Y YVEMT VG S P PQT LN I LVDT G S S N FAVGAAP H 106 



Qy 124 SYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFES 183 

: : I : : I I I I I I I I I I I I I : I I I I : I I I I : III I I I 

Db 107 PFLHRYYQRQLSSTYRDLRKGVWPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITES 166 

Qy 184 ENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV A 240 

: II: I I II I I I I I I : I : I III I II I I I I : I I I : I I : I : I I I I I : 

Db 167 DKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTE 226 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

: I I I : : : I I I : III I : I I I I I : MM:: | : : : | | | | | : || : || I I I : 
Db 227 ALASVGGSMIIGGIDHSLYTGRLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKS 286 

Qy 301 IVDSGTTLLRLPQKVFDAVVEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

I II M I I M I I : I M : I I : : : II : I II I I I I I M I M : I I I I : 

Db 287 IVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISL 346 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNY-ECYRFGISPSTNALVIGATVMEGFYVIF 419 

I I I : : : I I I II I M I I : : I : : : I I : I : I I : I : I I : II II I I : I 

Db 34 7 YLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWF 406 

Qy 42 0 DRAQ KRVG FAAS P CAE I AGAAVS EISGPFSTE DVAS N CVP AQ SLSEPILWI VS YALMS VC 47 9 

IIMMMIIII : : I II I M I I : : I : : : I 

Db 407 DRARKRIGFAVSACHVHDEFRTAAVEGPFVTADMEDCGYNIPQTDESTLMTIAYVMAAIC 466 

Qy 480 GAILLVLIVLLLLPFRCQR — RPRDPEWNDESSL 512 

I : : : : I : : Mil I : : Mil 

Db 467 -ALFMLPLCLMVCQWRCLRCLRHQHDDFADDISLL 500 



RESULT 13 
Q8IYC8 

ID Q8IYC8 PRELIMINARY; PRT; 501 AA. 

AC Q8IYC8; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Beta-site APP-cleaving enzyme. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Brain; 

RA Strausberg R. ; 

RL Submitted (JUL-2002) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; BC036084; AAH36084.1; 

DR GO; GO: 0004194; F:pepsin A activity; IEA. 

DR GO; GO: 0006508; P : proteolysis and peptidolysis ; IEA. 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR009007; Pept_A_acid. 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN. 

DR PROSITE; PS00141; ASP_PROTEASE; 1. 

SQ SEQUENCE 501 AA; 55824 MW; 768595CF5517EFB7 CRC64; 



Query Match 43.6%; Score 1172.5; DB 4; Length 501; 

Best Local Similarity 46.1%; Pred. No. 6.1e-83; 

Matches 239; Conservative 82; Mismatches 165; Indels 33; Gaps 9 



Qy 7 AL L L P L LAQWL L RAAP ELAP AP FT LPLRVAAATNRWAPTPGPGTPAERHADGLA 61 

I III t : : I I I MM II II 

Db 2 AQALPWLLLWM GAGVLPAHGTQHGIRLPLRSGLG GAPL GLR 42 

Qy 62 LALE — PALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVA 119 

I I I : I: I M II M M : II I : I I : M M I I II I M M M I II 

Db 43 LPRETDEEPEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFAVG 102 

Qy 120 GTPHSYIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIAT 17 9 

II:: I : : MM I I I I I I I I : I I I I : II I I : III 

Db 103 AAPHPFLHRYYQRQLFSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAA 162 

Qy 180 IFESENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV 239 

I II: II: I I I I I II I I I : I : I III I I M II I : : II : II : I : I II I I : 

Db 163 ITESDKFFINGSWWEGILGLAYAEIARPDDSLEPFFDSLVKQTHVPNLFSLQLCGAGFPL 222 

Qy 240 AGS GTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYN 296 

I : II I : : : I I I : III I : I I II I : MM:: I : : : I I I [ : I I : | II 

Db 223 NQSEVLASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYN 282 

Qy 297 ADKAIVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFP 356 

I I : I M I M I I M I : I II : I I : : : II - I Mil I II II MM II 

Db 283 YDKSIVDSGTTNLRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFP 342 

Qy 357 KISIYLRDENSSRSFRITILPQLYIQPMMGAGLNY-ECYRFGISPSTNALVIGATVMEGF 415 

M : M I : : : M II I I I I I M : I : : : I I : I M I : I : I I : II II 

Db 343 VISLYLMGEVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIMEGF 4 02 

Qy 416 YVI FD RAQKRVG FAAS P CAE I AGAAVS EISGPFSTE D VAS NC VPAQ S L S E P I LWI VS YAL 475 

I I : I I I I; I I : I I i I I : : | | I I I : I I : : I : 

Db 403 YWFDRARKRI GFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNI PQTDESTLMTIAYVM 462 

Qy 476 MSVCGAILLVLIVLLLLPFRCQR — RPRDPEWNDESSL 512 

: : I I : : : : I : : Ml I I : : Mil 
Db 463 AAIC-7\LFMLPLCLMVCQWRCLRCLRQQHDDFADDISLL 500 



RESULT 14 
Q8C4F4 

ID Q8C4F4 PRELIMINARY ; PRT; 4 67 AA. 

AC Q8C4F4; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Beta-site APP cleaving enzyme. 

GN BACE . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Cerebellum; 



RX MEDLINE=22354683; PubMed=12466851 ; 

RA The FAN TOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK082317; BAC38462.1; -. 

DR MGD; MGI: 1346542; Bace. 

DR GO; GO:0004194; F: pepsin A activity; IEA. 

DR GO; GO: 0006508; P : proteolysis and peptidolysis ; IEA. 

DR InterPro; IPR001969; Aspprotease_AS . 

DR InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR009007; Pept_A_acid. 

DR Pfam; PF00026; asp; 1. 

DR PRINTS; PR00792; PEPSIN . 

DR PROSITE; PS00141; ASP^PROTEASE; 1. 

SQ SEQUENCE 467 AA; 52063 MW; 31AB674FF1843652 CRC64; 

Query Match 39.0%; Score 1049; DB 11; Length 467; 

Best Local Similarity 41.7%; Pred. No. 2.4e~73; 

Matches 215; Conservative 77; Mismatches 163; Indels 60; Gaps 8; 

Qy 9 L L P L LAQWL L RAAP E LAP AP FT L P L RVAAATN RWAP T P G P GT PAE RHAD GLALA 63 

: ! I | I : I I I I I I I I i I II : 

Db 1 MAPALHWLLLWVGSGMLPAQGTHLGIRLPLRSGLA GPPLGLRLPRETDEES 51 

Qy 64 LEPALAS PAGAANFLAMVDNLQGDS GRGYYLEMLI GT PPQKLQI LVDTGS SN FAVAGT PH 123 

I : I : I I I I I : I I I : I II : II : I : II I I I I I I I I I I II II II 

Db 52 EEPGRRGS FVEMVDNLRGKS GQGYYVEMTVGS P PQTLNI LVDTGS SNFAVGAAPH 106 

Qy 124 S YI DT YFDTERS S T YRS KGFDVTVKYTQGSWTGFVGEDLVTI PKGFNTS FLVNIATI FES 183 

: : I : : I I II I I I I I II I I : I I I I : I I I I : Ml I I 

Db 107 PFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPHGPNVTVRANIAAITES 166 

Qy 184 ENFFLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPV A 240 

: II: I I I I I I I I I I : I : I III I I I I I I I : I I I : I I : I : I I I I I : 
Db 167 DKFFINGSNWEGILGLAYAEIARPDDSLEPFFDSLVKQTHIPNIFSLQLCGAGFPLNQTE 226 

Qy 241 GSGTNGGSLVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKA 300 

: I I I : : : II I : III I : I I M I : INI:: I : : : I I II I : I I : I 
Db 227 ALASVGGSMIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKE 280 

Qy 301 IVDSGTTLLRLPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISI 360 

: I I I I I I I I II I I I : M I I : 

Db 281 TEKFPDGFWLGEQLVCWQAGTTPWNIFPVISL 312 

Qy 361 YLRDENSSRSFRITILPQLYIQPMMGAGLNY-ECYRFGISPSTNALVIGATVMEGFYVIF 419 

II I : : : I I I I M I I I I : : I : : : M : I : I I : I : II : I II I I I : I 

Db 313 YLMGEVTNQS FRITI LPQQYLRPVEDVAT SQDDCYKFAVSQS STGTVMGAVIMEGFYWF 372 

Qy 420 D RAQ K RVG FAAS P CAE I AG AAVS EISGPFSTE D VA S N C VP AQ SLSEPILWIVS Y ALM S VC 479 

I I I : M : M I I I : : I M I I : I I : : I : : : I 

Db 37 3 DRARKRIGFAVSACHVHDEFRTAAVEGPFVTADMEDCGYNIPQTDESTLMTIAYVMAAIC 432 

Qy 480 GAILLVLIVLLLLPFRCQR — RPRDPEWNDESSL 512 

I : : : : I : : : I I I I : : : I I I 

Db 433 -ALFMLPLCLMVCQWRCLRCLRHQHDDFADDI SLL 4 66 



RESULT 15 
Q9CUU5 

ID Q9CUU5 PRELIMINARY; PRT; 267 AA. 

AC Q9CUU5; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Adult male brain cDNA, RIKEN full-length enriched library, 

DE clone : 3526402A15 product : beta-site APP cleaving enzyme, full insert 

DE sequence (Fragment) . 

GN BACE. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxI D=l 0090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE^Brain; 

RA Adachi J., Aizawa K. f Akahira S-, Akimura T., Arai A., Aono H. , 

RA Arakawa T., Bono H. , Carninci P., Fukuda S., Fukunishi Y., Furuno M 

RA Hanagaki T., Hara A., Hayatsu N., Hiramoto K. , Hiraoka T., Hori F., 

RA Imotani K. , Ishii Y., Itoh M., Izawa M. , Kasukawa T., Kato H., 

RA Kawai J., Kojima Y., Konno H., Kouda M. , Koya S., Kurihara C, 

RA Matsuyama T., Miyazaki A., Nishi K., Nomura K. , Numazaki R. , Ohno M 

RA Okazaki Y. , Okido T., Owa C, Saito H., Saito R. , Sakai C, Sakai K 

RA Sano H. , Sasaki D., Shibata K. , Shibata Y., Shinagawa A., Shiraki T 

RA Sogabe Y. , Suzuki H., Tagami M., Tagawa A., Takahashi F., Tanaka T. 

RA Tejima Y. , Toya T., Yamamura T., Yasunishi A., Yoshida K. , Yoshino 3 

RA Muramatsu M. , Hayashizaki Y. ; 

RL Submitted (JUL-2000) to the EMBL/ GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN-C57BL/ 6J; TISSUE=Brain; 

RX MEDLINE-22354683; PubMed=12466851; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Brain; 

RX MEDLINE=21085660; PubMed=11217851 ; 

RA RIKEN FANTOM Consortium; 

RT "Functional annotation of a full-length mouse cDNA collection."; 

RL Nature 409:685-690(2001). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Brain; 

RX MEDLINE=99279253; PubMed=1034 9636; 

RA Carninci P., Hayashizaki Y.; 

RT "High-efficiency full-length cDNA cloning."; 

RL Meth. Enzymol. 303:19-44(1999). 

RN [5] 

RP SEQUENCE FROM N.A. 



RC STRAIN=C57BL/6J; TISSUE=Brain; 

RX MEDLINE=20499374; PubMed-11042 159 ; 

RA Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M., 

RA Konno R\ , Okazaki Y., Muramatsu M. f Hayashizaki Y.; 

RT "Normalization and subtraction of cap-trapper-selected cDNAs to 

RT prepare full-length cDNA libraries for rapid discovery of new genes."; 

RL Genome Res. 10:1617-1630(2000). 

RN [6] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Brain; 

RX MEDLINE=2 0530913; PubMed=1107 6861; 

RA Shibata K., Itoh M. , Aizawa K. , Nagaoka S., Sasaki N. , Carninci P., 

RA Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M. , 

RA Sumi N., Ishii Y., Nakamura S., Hazama M. , Nishine T . , Harada A., 

RA Yamamoto R. , Matsumoto H., Sakaguchi S . , Ikegami T., Kashiwagi K., 

RA Fujiwake S., Inoue K. , Togawa Y. , Izawa M. , Ohara E., Watahiki M. , 

RA Yoneda Y. , Ishikawa T., Ozawa K. , Tanaka T. r Matsuura S,, Kawai J. , 

RA Okazaki Y., Muramatsu M. , Inoue Y w Kira A., Hayashizaki Y. ; 

RT "RIKEN integrated sequence analysis (RISA) system-384-format 

RT sequencing pipeline with 384 multicapillary sequencer. "; 

RL Genome Res. 10:1757-1771(2000). 

DR EMBL; AK014390; BAB29317.2; -. 

DR MGD; MGI : 1346542; Bace. 

DR GO; GO: 0004194; F:pepsin A activity; IEA. 

DR GO; GO:0006508; P : proteolysis and peptidolysis; IEA. 

DR InterPro; IPR001461; Peptidase_Al . 

DR InterPro; IPR009007; Pept_A_acid. 

DR Pfam; PF00026; asp; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 267 AA; 30333 MW; 94 13EB4530AB63B0 CRC64; 

Query Match 24.3%; Score 653; DB 11; Length 267; 

Best Local Similarity 45.3%; Pred. No. 8.8e-43; 

Matches 121; Conservative 56; Mismatches 86; Indels 4; Gaps 3 

Qy 249 LVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTL 308 

: : : I I I : III I : I I I I I : MM:: I : : : II I I I : I I : I I I I I : I I I I I I I 
Db 1 MIIGGIDHSLYTGSLWYTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTN 60 

Qy 309 LRLPQKVFDAVVEAVARASLIPEFSDGHWTGSQLACWTNSETPWSYFPKISIYLRDENSS 368 

I I I I : II I : I I : : : II : I I I I I I II I I I I I : I I I I : I I I : : 

Db 61 LRLPKKVFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMGEVTN 120 

Qy 369 RS FRI T I LPQLYI QPMMGAGLNY- EC YRFGI S P STNALVT GATVMEGFYVI FDRAQKRVG 427 

: M I 1 II I M I : : I : : : I I : I : I I : I : I I : I I I I I I : I I I I : I I : I 

Db 121 QSFRITILPQQYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIMEGFYWFDRARKRIG 180 

Qy 428 FAAS P CAE I AGAAVS EISGPFSTE DVAS N CVP AQ S L S E P I LW I VS YALMS VCGAI L LVL I 487 

II I I : : I I I I I : I I : : I : : : I I : : : : 

Db 181 FAVSACHVHDEFRTAAVEGPFVTADMEDCGYNIPQTDESTLMTIAYVMAAIC-ALFMLPL 239 

Qy 488 VLLLLPFRCQR — RPRDPEWNDESSL 512 

I : : : I I I I : : : I I I 

Db 240 CLMVCQWRCLRCLRHQHDDFADDI SLL 2 66 



Search completed: February 28, 2004, 09:11:16 



Job time : 45.4848 sees 
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